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ESE GENES AND PROTEINS 

Field of the Invention 

This invention relates to novel mammalian proteins identified as Esel and Ese2 
5 which are involved in endocytosis via clathrin coated pits. 

Background of the Invention 

Endocvto sis via clathrin-coated pits is a multipstep pmr^ (i ?) H athrin h e avy 

and light chains are brought to the plasma membrane through association with a 
1 0 heterotetrameric complex known as clathrin adaptor complex 2, or AP-2. At coated pits, 
the membrane is bent through the assembly of clathrin triskelion into a caged lattice. The 
GTPase Dynamin is also recruited to the neck of coated pits where it assembles into a 
collar for vesicle fission (4, 5). Recruitment of Dynamin to coated pits is believed to 
require the Amphiphysin I/II heterodimer (6), as these proteins bind Dynamin in vivo and 
1 5 amphiphysin proteins are required for endocytosis in yeast (7). In addition, ectopic 

expression of either Amphiphysin I or IT by themselves (6), or the isolated SH3 domain of 
Amphiphysin I blocks endocytosis (8, 9). Recent data has revealed that the Rab5 small 
GTPase is required for sequestration of ligands such as transferrin and Epidermal Growth 
Factor into coated pits in vitro (10). The mechanism by which these components interact 

20 to regulate coated pit assembly, cargo sequestration, followed by vesicle fission is not yet 
understood. 

From biochemical, cell biological and genetic analysis it is clear that additional 
components such as kinases, phosphatases, ubiquitin conjugating enzymes as well as lipid 
modifying enzymes are required for clathrin-coat and vesicle formation (1, 1 1, 12, 13, 14, 

25 15, 16, 17, 18, 19). Indeed there is also strong evidence for a requirement of the actin 

cytoskeleton in endocytosis and several proteins which may facilitate this connection (1,7, 
20,21, 22, 23, 24, 25). 

The Epsl5 protein was discovered in a search for substrates of the Epidermal 
Growth Factor Receptor(26). In 1995, Benmerah et. al. reported that Epsl5 is 

30 constitutively associated with a-adaptin of the AP2 complex (27). The Epsl 5 protein has 
also been localized to the neck of clathrin-coated pits by immunoelectron microscopy (28, 
29). Recently, two groups have used dominant inhibitory mutants of Epsl5, or antibodies 
aglainst Epsl 5 (or the related protein Epsl5R), to demonstrate that Epsl 5 proteins are 
required for endocytosis via clathrin-coated pits (30, 31). Epsl 5 contains three large 
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structural domains(26, 32). The N-terminal third contains three copies of an EH domain 
(for Epsl5 Homology domain) (32, 33). The central region of Epsl5 forms an extended 
coiled-coil, which is followed by a complex C-terminus containing SH3-binding motifs 
(34), a large number of DPF repeats (Aspartic acid-Proline-Phenylalanine) of unknown 
5 function, and a-adaptin binding sequences (31, 35, 36). The full length Epsl5R protein 
has a similar overall organization (34). Both EpslS and EpslSR can be alternatively 
spliced to produce numerous smaller proteins(37). 
i A prntein with similar overall organization has heen identified in Rfir.r.hn rninyaex 

cerevisiae named Panlp. Genetic analysis of PAN I has revealed that this gene is required 

10 fo(r endocytosis and for organization of the actin cytoskeleton (23, 38). Like EpslS and 
Ef)sl5R in mammals, the Panlp protein has N-terminal EH domains followed by a central 
coiled-coil domain and C-terminal proline-rich sequences. A second EH domain 
containing protein, End3p, has also been described in S. cerevisiae which is required for 
endocytosis and regulation of the actin cytoskeleton (21, 39). Co-immunoprecipitation 

15 studies have shown that Panlp and End3p form a complex in vivo (40) . Indeed, 

overexpression of End3p can suppress the phenotype of pan J -4 hypomorphic mutants, 
and Panlp is mislocalized in ends mutants indicating that these proteins function together 
(4j0). Additional studies have revealed that the EH domains of Panlp bind to yeast 
hdmologues of mammalian clathrin-binding proteins, API 80 and CALM (yAP180A and 

20 yAPl 80B), through NPF motifs (Asparagine-Proline-Phenylalanine) in the yAPl 80 C- 

termini(25). These data have led to a proposal that the Panlp:End3p complex functions as 
a multivalent adaptor to coordinate protein-protein interactions during endocytosis (25, 
40). At least two additional proteins are predicted to bind to the Panlp;End3p complex in 
vivo, as strong genetic interactions have been detected between PAN1 and SJL1(25), and 

25 beftween PAN1 and RSP5 (41). SJL1 encodes a phosphatidylinositol polyphosphate-5- 

phosphatase protein which is related to mammalian synaptojanin (42) and has a C-terminal 
NPF motif predicted to bind to EH domains in Panlp (or End3p) (25, 43) . RSP5 encodes 
an E3 ubiquitin-protein ligase which may bind to the C-terminal polyproltne sequences in 
Panlp through one of its three WW domains (25). 

30 Numerous SH3 domain containing proteins have been implicated in the regulation 

of endocytosis (44). These include Amphiphysin 1(45) and II (6, 46, 47, 48), 
Rsvl61/Rsv 167(7), Actin Binding Protein- 1(49), Endophilin/SH3P4/8/13 (50, 51) and 
Gfb2 (52). Kay and coworkers have reported the isolation of several novel SH3 encoding 
cDNAs (53). 
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The present inventors have identified novel mammalian proteins containing both 
EH and SH3 domains, which have been named Esel and Ese2. Sequence and functional 
analysis of the full length proteins have implicated these proteins in both pinocytosis and 
receptor mediated endocytosis via clathrin coated pits and therefore the proteins have been 
5 named Esel and Ese2 respectively (Ese: for EH-domain and SH3 domain regulator of 
Endocytosis). 

Summary of the Invention 

In accordance with one series of embodiments, this invention provides isolated 
10 nucleic acids corresponding to or relating to the nucleic acid sequences disclosed herein 
which encode the murine Esel and Ese2 proteins. 

One of ordinary skill in the art is now able to identify and isolate mammalian Ese 
protein genes or cDNAs which are allelic variants of the disclosed sequences or are 
homologues thereof, in other species, including humans, using standard hybridisation 
1 5 screening or PCR techniques. 

In a further embodiment, the invention provide cDNA sequences encoding murine 
Esel and Ese2 proteins comprising the nucleotide sequences of Sequence ED NOS: 1 , 2, 4 
and 5 

Also provided are portions of the Ese gene sequences useful as probes or PCR 
20 primers or for encoding fragments, functional domains or antigenic determinants of Ese 
proteins. 

The invention also provides portions of the disclosed nucleic acid sequences 
comprising about 10 consecutive nucleotides to nearly the complete disclosed nucleic acid 
sequences. The invention provides isolated nucleic acid sequences comprising sequences 
25 corresponding to at least 10, preferably 15 and more preferably at least 20 consecutive 
nucleotides of the Ese genes as disclosed or enabled herein or their complements. 

In addition, the isolated nucleic acids of the invention include any of the above 
described nucleotide sequences included in a vector. 

In accordance with a further series of embodiments, this invention provides 
30 substantially pure murine Ese proteins, fragments of these proteins and fusion proteins 
including these proteins and fragments. 

In accordance with a further series of embodiments, this invention provides 
substantially pure mutant murine Ese proteins, fragments of these proteins and fusion 
proteins including these mutant proteins and fragments. 
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The proteins, fragments and fusion proteins have utility, as described herein, for the 
preparation of polyclonal and monoclonal antibodies to mammalian Ese proteins, for the 
identification of binding partners of the mammalian Ese proteins and for diagnostic and 
therapeutic methods, as described herein. For these uses, the present invention provides 
substantially pure proteins, polypeptides or derivatives of polypeptides which comprise 
portions of the mammalian Ese amino acid sequences disclosed or enabled herein and 
which may vary from about 4 to 5 amino acids to the complete amino acid sequence of the 
proteins. The invention provides substantially pure proteins or polypeptides comprising 



sequences corresponding to at least 5, preferably at least 10 and more preferably 50 or 100 
10 consecutive amino acids of the mammalian Ese proteins disclosed or enabled herein. 

The proteins of the invention may be isolated and purified by an conventional 
method suitable in relation to the properties revealed by the amino acid sequences of these 
proteins. 

Alternatively, cell lines may be produced which express or over-express the Ese 
1 5 gene products, allowing purification of the proteins for biochemical characterisation, large- 
scale production, antibody production and patient therapy. 

For protein expression, eukaryotic or prokaryotic expression systems may be 
generated in which an Ese gene sequence is introduced into a plasmid or other vector 
which is then introduced into living cells. Constructs in which the Ese cDNA sequence 
20 containing the entire open reading frame is inserted in the correct orientation into an 
expression plasmid may be used for protein expression. Alternatively, portions of the 
sequence may be inserted. Prokaryotic or eukaryotic expression systems allow various 
important functional domains of the proteins to be recovered as fusion proteins and used 
for binding, structural and functional studies and also for the generation of appropriate 
25 antibodies. 

The present invention includes effective fragments, analogues of the Ese proteins 
described herein. "Effective" fragments or analogues retain the activity of the described 
Ese proteins to promote endocytosis via clathrin-coated pits. The term "analogue" 
extends to any functional and/or chemical equivalent of a mammalian Ese protein including 
30 mimetics and includes proteins having one or more conservative amino acid substitutions, 
proteins incorporation unnatural amino acids and proteins having modified side chains. 

In accordance with a further embodiment of the invention, antibodies are enabled 
which bind specifically to the Ese proteins disclosed herein. Polyclonal or monoclonal 
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antibodies may be prepared using conventional methods. Antibodies may also be prepared 
to individual selected domains of the Ese proteins, as described herein. 

In a further embodiment, the invention provides pharmaceutical compositions 
containing an Ese protein or a non-functional mutant Ese protein for the treatment of 
5 mammalian disorders which involve abnormal endocytosis leading to altered cellular 
functioning- Administration of a therapeutically active amount of a pharmaceutical 
composition of the present invention means an amount effective, at dosages and for 

periods of time necessary to achieve the desired result. 

In accordance with a further embodiment, the invention provides a method for 
10 identifying binding partners of the Ese proteins disclosed herein. Such methods in general 
include various assays including those including radiolabelling of the Ese proteins. Other 
methods may include but are not restricted to phage display, affinity purification 
techniques, expression cloning and the yeast 2-hybrid system, as described herein. 

In accordance with a further embodiment of the present invention, is a method for 
1 5 identifying proteins which phosphorylate Ese proteins. Such method includes known 
phosphorylation assays. 

The identification of proteins or peptides interact with or bind to Ese proteins can 
provide the basis for the design of peptide antagonists or agonists of Ese protein function 
or for the design of peptide antagonists or agonists of Ese protein binding partners which 
20 affect Ese protein function. Further, the structure of these peptides determined by 
standard techniques such as protein NMR or x-ray crystallography can provide the 
structural basis for the design of improved small molecule drugs. 

In accordance with a further embodiment, the present invention also provides for 
the production of mouse models or transgenic non-human animal models for the study of 
25 mammalian Ese gene function, for the screening of candidate pharmaceutical compounds, 
for the creation of in vitro mammalian cell cultures which express the Ese proteins or in 
which an Ese gene has been inactivated by knock-out deletion, and for the evaluation of 
potential therapeutic interventions. 

The invention enables a transgenic animal, including a transgenic insect, wherein 
30 the genome of the animal or of an ancestor of the animal has been modified by introduction 
of a transgene comprising mammalian Ese genes under the transcriptional control of tissue 
restricted regulatory elements including the mouse mammary-tumour virus long term 
repeat sequences. 
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Transgenic animals with inappropriate expression of Ese proteins may be examined 
for phenotypic changes, for example abnormal cellular development or abnormal cellular 
signalling and may be used to screen for compounds with potential as pharmaceuticals. 
Compounds which provide reversal of the phenotypic changes are candidates for 
5 development as pharmaceuticals. 

Transgenic animals in accordance with the invention can be created by introducing 
a DNA sequence encoding a selected Ese protein either into embryonic stem (ES) cells of 

a suitable animal such as a mouse, by transfection or microinjection, or into a germ line or 

stem cell by a standard technique of oocyte microinjection. Such methods of producing 
10 animal models are fully described in the literature. 

In accordance with another aspect of the present invention is a method for 
screening a candidate compound for effectiveness as an antagonist of an Ese protein 
comprising: 

(a) providing an assay method for determining the endocytotic regulatory 
1 5 capacity of an Ese protein; and 

(b) determining the endocytotic regulatory capacity of the Ese protein in the 
presence or absence of the candidate compound, wherein a reduced level of 
endocytosis in the presence of the candidate compound indicates antagonist activity 
of the compound. 

20 In accordance with another aspect of the present invention is a method for treating 

in a mammal a disorder associated with an undesired level of endocytotic activity of an Ese 
protein comprising administering to the mammal an effective amount of a substance 
selected from the group consisting of: 

(a) an Ese protein antagonist; 

25 (b) an antibody which binds specifically to an Ese protein; 

(c) an antisense strand comprising a nucleic acid sequence complementary to 
the sequence or fragment of the sequence and capable of hybridizing to the nucleic 
acid sequence encoding an Ese protein; 

(d) an agent which down regulates the expression of the Ese gene encoding for 
30 an Ese protein; 

(e) an antagonist of an Ese protein binding partner; and 

(f) an Ese agonist. 

According to another aspect of the present invention is a method for suppressing in 
a mammal, the abnormal proliferation of a cell capable of being stimulated to proliferate by 
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a growth factor receptor, the method comprising administering to the mammal an effective 
amount of a Ese protein antagonist, an Ese agonist or an antibody which binds specifically 
to an Ese protein. 

According to yet another aspect of the present invention is a method for preventing 
5 viral infection in a mammal, said method comprising administering to the mammal an 

effective amount of an Ese protein antagonist, an Ese agonist or an antibody which binds 
specifically to an Ese protein or an Ese mutant protein not capable of regulating 

endocytosis. 

According to a further aspect of the present invention is a method for causing 
10 endocytosis in selected cells in a mammal in need of such treatment, said method 

comprising administering to the mammal an effective amount of an Ese protein or an active 
analogue, mimic or fragment thereof. 

Other objects, features and advantages of the present invention will become 
apparent from the following detailed description. It should be understood, however, that 
15 the detailed description and the specific examples while indicating preferred embodiments 
of the invention are given by way of illustration only, since various changes and 
modifications within the spirit and scope of the invention will become apparent to those 
skilled in the art from this detailed description. 

20 Summary of Tables and Drawings 

A detailed description of the preferred embodiments are provided herein below 
with reference to the following tables and drawings in which: 

Table 1 shows alignment of mouse Esel and Ese2 protein sequences. EH, coiled- 
coil and SH3 domains are as indicated. 
25 Figure 1 shows a Northern blot demonstrating expression of Esel and Ese2 genes 

in various adult tissues. 

Figure 2 shows Western blots demonstrating the association of Ese and Epsl5 
proteins in vivo. Endogenous Esel proteins were precipitated with Rabbit anti-peptide 
antisera against the N- terminus of Esel. lmmunoprecipitates were then analyzed on 
30 western blots for the presence of Ese 1/2 with Chicken anti-Ese antisera or for co- 
precipitation of EpslS with Rabbit antisera raised against the C-terminus of Epsl5. 

Figure 3 shows association of Ese and Epsl5 C-terminal deletion mutants. Cos-1 
cells were transfected with pcDNA3Epsl5, pcDNA3Esel or the C-terminal deletion 
mptants pcDNA3Ese!5C and pcDNA3Eps!58C-Flag as indicated. Cell lysates were 
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immunoprecipitated with rabbit anti-Esel or rabbit anti-Epsl5 (left panel); rabbit anti- 
Esel, mouse anti-Flag or rabbit anti-Epsl5 (right panel). Panels represent western blots to 
detect the presence of Esel in each immunoprecipitation. A 90kDa Esel protein exists in 
the third and sixth lanes on the left panel which is the C-terminally truncated Esel protein 
which is co-immunoprecipitated in a complex with Epsl5 in the sixth lane. Also, to be 
noted is the precipitation of Esel with anti-Flag monoclonal antibody in the sixth lane of 
the right panel experiment. In this case, Esel has been precipitated in a complex with the 
C-terminally truncated Epsl55C protein. ___ 



Figure 4 shows confocal immunofluorescent micrographs of Cos cells transfected 
10 with myc-Esel (A, A 1 ); EpslS (B); myc-Esel + Epsl5 (C, C\ C'\D) or myc-Esel + 
Epsl58C (E). 

Figure 5 shows confocal immunofluorescent micrographs of Cos cells transfected 
with myc-Esel and probed with antibodies to myc (A), Dynamin (B) or both antibodies 
(C). 

15 Figure 6 shows a proposed model for Ese regulation of Endocytosis. 

In the drawings, preferred embodiments of the invention are illustrated by way of 
example. It is to be expressly understood that the description and drawings are only for 
the purpose of illustration and as an aid to understanding, and are not intended as a 
definition of the limits of the invention. 

20 

Detailed Description of the Invention 

Se quencing of Murine Esel and Ese2 penes 

The full length murine Esel cDNA was sequenced (Sequence ID NO: 1). It 
encodes a sequence of 1213 amino acids (Sequence ID NO:3) having a predicted 

25 molecular weight of 137 kDa. Murine Ese2 cDNA was also sequenced (Sequence ID 
NO:4) and encodes a sequence of 1 197 amino acids (Sequence ID NO: 6) having a 
predicted molecular weight of 135.7 kDa.. 

The full length Ese proteins are predicted to encode two N-terminal EH domains 
followed by a coiled-coil domain and five SH3 domains (Table 1). Several Esel isolated 

30 clones contain only SH3 A , SH3 B and SH3 E domains. Indeed, the SH3 C domain was not 
included in the original human SH3P17 partial cDNA. In addition, expressed sequence 
tags from the Esel gene have been found in the public databases which skip sequences 
encoding individual EH domains or regions of the coiled-coil domain indicating that this 
gene is subject to complex alternative splicing and has the potential to encode for many 
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distinct proteins. The predicted Ese2 protein on the other hand encodes a C-terminal 
extension of 45 amino acids in comparison to the human SH3P1 8 partial cDNA, 
suggesting that this gene is also alternatively spliced. The Ese proteins are 53% identical 
over the full length of Ese 1 (645 of 1213 residues in Esel line up with identical residues in 
5 Ese2) and are related to the Ese protein from Xenopus which has recently been submitted 
to genebank (Accession # AF0321 18). 

The two EH domains of Esel are highly related to the respective EH domains in 
Ese2 and these Ese EH domains are most closely related those found in Epsl5 and 
Epsl 5R, two mammalian protein which are required for endocytosis through clathrin- 

10 coated pits. EH domains have also been identified in End3 and Panlp which are yeast 
partners proteins required for endocytosis. The central third of both Ese proteins are 
predicted to encode an extended coiled-coil which is a domain typically associated with 
protein-protein association through dimerization or tetramerization as noted for Epsl 5 
(29, 54). Besides the homology between Esel and Ese2, the Esel SH3 domains are most 

] 5 closely related to SH3 domains from Myosin IB in Acanthamoeba (SH3 A ), Myosin IB in 
Entamoeba (SH3 B ), the YFR024 hypothetic yeast protein (SH3 C ), Myosin IB from 
Acanthamoeba (SH3 D ) and Myosin IC from Acanthamoeba (SH3 F ,), The same homologies 
are noted for SH3 domains from Ese2 with the exception of SH3 A which is most similar to 
the SH3 domain from pPIX, SH3 C which is most similar to an SH3 domain from the 

20 mouse Ray protein and SH3 D which is most similar to the SH3 domain from Dictyostelium 
myosin EB. Additional sequence analysis reveals the presence of a very large number of 
potential phosphorylation sites in the Ese proteins and a single SH3 binding consensus in 
the N-termini of each Ese protein. 

25 Expression of Ese genes 

In order to determine where the Ese genes are expressed we have performed 
Northern analysis on mRNA derived from several adult mouse tissues (Figure 1). These 
genes are both widely expressed and reveal a complex pattern of alternatively spliced 
transcripts. The highest levels of Esel mRNA were noted in brain, heart and skeletal 

30 muscle. Brain and skeletal muscle mRNA also contain an unusually large transcript which 
may be as much as 15kb in length. Interestingly, these tissues express high levels of many 
proteins involved in synaptic vesicle recycling and endocytosis. A prominent 2.4kb Ese2 
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transcript has also been noted which is significantly smaller than the 3591bp sequence 
required to code for the full Ese2 protein as shown in Table 1 
Subcellular localisation of Ese proteins 

To determine the subcellular localization of Esel, a myc-epitope tagged version of 
5 this protein (mycEsel) was engineered. This tagged protein was expressed in Cos cells 
and detected by confocal immunofluoresence using the mouse anti-myc monoclonal 
antibody 9E10 (Figure 4). Interestingly, the transfected Esel protein is highly concentrated 
into circular domains (Figure 4A and 4C) which are present throughout transfected cells 
(compare single optical section in Figure 4A with the projection of all sections through the 

10 same cell shown in 4A'). Esel overexpressed in 10T1/2, BHK and Hela cells using the 
vaccinia virus T7 expression system is also localized to large circular domains (data not 
shown). This concentration of ectopically expressed Esel contrasts with the localization 
of Epsl5 in transiently transfected cells, EpslS being dispersed throughout the cell (Figure 
4B). Esel and EpslS proteins form a complex in vivo and yet localize to distinct 

1 5 subcellular compartments in transfected Cos cells. We therefore determined the 

localization of both mycEsel and Epsl5 in Cos cells co-transfected with both genes. In 
co-transfected cells, mycEsel is still found in circular domains (Figure 4C). Interestingly, 
the transfected Epsl 5 is now also partially co-localized with Esel in the same circular 
domains (Figure 4C and C"). 

20 The C-terminal third of Epsl 5 contains several regions which are required for 

association with a-adaptin of the AP2 clathrin adaptor complex (31). It has been shown 
that this region of Eps 1 5 is not required for its association with Esel (Figure 4). In order 
to test whether EpslS function may be required for Ese localization, mycEsel and 
EpslSsC have been co-transfected. Interestingly, mycEsel localization is completely 

25 altered in cells co-transfected with EpslSsC (compare Figure 4D and E) indicating that 
Esel requires EpslS function for its distinctive subcellular localization. In addition, these 
data suggest that the Esel:Epsl5 complex must be associated with AP2 in order to form 
the large circular domains in Esel transfected cells. 

The Dynamin GTPase is required for clathrin-coated vesicle fission and is recruited 

30 to coated pits by Amphiphysin I/II heterodimers (6). In order to study the subcellular 
localization of Dynamin in Esel overexpressing cells we have co-stained for the myc 
epitope tag with Rabbit anti-myc antisera and for endogenous Dynamin with mouse anti- 
Dynamin monoclonal antibodies. Surprisingly, endogenous Dynamin is recruited to the 
Esel staining circles and both proteins are co-localized (Figure 5). These results 
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demonstrate that mycEsel -containing circles are endocytic structures and that Esel is a 
rate limiting component of the early endocytic compartment which can function together 
with Epsl5 to induce Dynamin recruitment. 

Identification of binding partners of Ese proteins 

In order to identify Ese partners, a systematic analysis was initiated of each domain 
for binding partners using the yeast two hybrid system. The central coiled-coil domain of 
Esel from amino acid 330 to 732 was fused to the GAL4 DNA binding domain and 
transformed into the Y190 reporter strain together with plasmid cDNA libraries from 
several tissue sources. Yeast colonies were selected for growth on Histidine minus plates 
to select for interaction between library encoded GAL4 activation domain fusions and the 
GAL4 DNA binding domain Esel coiled-coil bait. Yeast colonies which survived 
selection for expression of the His3 gene where also tested for induction of the integrated 
LacZ gene which is GAL4 responsive in Y190. Prey plasmids were recovered from 37 
positive yeast colonies which were selected for expression of both His3 and LacZ. Several 
classes of cDNA were recovered in this screen. One class included Epsl 5 which was 
isolated twice and EpslSR which was isolated from four yeast colonies. Interestingly 
these clones were all partial cDNA fusions which minimally included C-terminal sequences 
from the central coiled-coil domain and downstream. In the case of Epsl 5, the positive 
clones start from amino acid 306 and 376 whereas in EpslSR the N-terminal boundary of 
clones were amino acid 4, 10, 222 and 386. These data indicate that minimal sequences 
required for interaction between the Esel coiled-coil domain and Epsl5(R) include either 
or both of the central coiled-coil and/or the C-terminal third of these proteins, 

The Ese proteins were analyzed in vivo to test for their association with Epsl 5 or 
EpslSR. Polyclonal antisera were raised in chickens against a GST fusion containing the 
C-terminus of Esel from amino acid 665 to the stop codon. This region of Esel contains 
all five of the SH3 domains. In addition, polyclonal antisera were generated in rabbits 
against a peptide representing the first 21 amino acids of Esel . Cell ly sates were prepared 
from A431, PC 12, MDCK and Hela cells which represent cell lines from several distinct 
tissue types and species. The rabbit anti-peptide antisera were used to precipitate Esel 
from each lysate and precipitates were analyzed by western blotting using the chicken anti- 
Ese sera. In each cell line the presence of several specific bands in the range of 1 50kDa 
were observed which were precipitated in the absence but not in the presence of the 
peptide to which the sera was generated (Figure 2). Thus the Esel protein is expressed in 
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many tissue culture cells and runs in a range consistent with the predicted molecular 
weight of 137kDa. The same samples were also analyzed for co-immunoprecipitation of 
Epsl 5 proteins. In each case multiple Epsl 5 proteins were detected which co-purify with 
Esel . The anti-Esel peptide antisera and commercially available anti-Epsl5 antisera 
which were used were raised against epitopes which are not shared by these proteins 
indicating that Esel and Epsl 5 are constitutively associated partners in vivo. This 
association is reminiscent of the previously detected complex between two EH domain 
containing yeast proteins, End3p and Panlp. 



Esel and Epsl 5 proteins both contain central coiled-coil motifs. In addition, Esel 

1 0 contains multiple C-terminal SH3 domains while Epsl 5 contains SH3-binding motifs. In 
order to map the regions of each protein which are required for their association in vivo, 
C-terminal truncations of each (Epsl55C and Esel6C) were generated. Full length Epsl5 
was co-transfected into Cos-1 cells together with either full length Esel or C-terminally 
truncated Esel SC. Cell lysates were precipitated with either rabbit anti-Epsl5 or with 

15 rabbit anti-Esel, and precipitates were western blotted with chicken anti Esel antisera. 
Interestingly, the C-terminally truncated Esel protein was efficiently immunoprecipitated 
in a complex with Epsl 5 using anti-Epsl5 sera. In a reciprocal experiment Cos-1 cells 
were transfected with Esel alone or together with Epsl 5 or C-terminally truncated Epsl 5 
which had been Flag-epitope tagged (Epsl55C-Flag). Cell lysates were prepared and 

20 immunoprecipitated with either rabbit anti-Esel, mouse anti-Flag, or rabbit anti-Epsl5 
antibodies. These immunoprecipitations were also western blotted to analyze for the 
presence of Esel . The anti-Flag antibody efficiently precipitated Esel from cells 
expressing Flag-tagged Epsl55C indicating that the C-terminally truncated Epsl 5 protein 
can bind to Esel in vivo. The Esel and Epsl 5 proteins are therefore associated through 

25 interaction of their central coiled-coil regions and do not require the presence of SH3 and 
SH3 -binding motifs in their respective C-termini. 

In addition to the identification of Epsl 5 and Epsl5R in the yeast two hybrid 
screen with the coiled-coil domain bait of Esel, the following were also identified as Ese 
binding proteins: TSG101 (accession #U52945), meningioma expressed antigens 6/1 1 

30 (accession #U94780 for mea6), p-tropomyosin, rabaptinS (accession #D86066), Adora2a 
(accession #Y 13345), LI lipid binding protein (accession #K02109) and numerous 
cytokeratins and laminins. Novel genes identified in this screen are detailed below. 
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Also performed was a yeast two hybrid screen using amino acids 665-1213 of Esel 
as bait. This screen led to the isolation of the following clones which produced GAL4 
Activation domain fusions which bound to this SH3 domain bait of Esel . The Ese-binding 
proteins identified in our SH3 screen were the cbl-b oncoprotein (accession #U26712), 
Dynamin II (accession #L3 1398), KIAA0268 (accession #D87742), Jerky (accession 
#U35730), hnRNP-K (accession #L29769), SAP49 (accession #L35013). Novel genes 
identified in this screen are detailed below and several novel clones as outlined below. 

These results demonstrate that the novel Ese genes and the proteins that they 

encode function in a complex with EpslS proteins to regulate endocytosis together. In 
addition, this complex contains binding sites for numerous other proteins. Furthermore, 
with the identification of several potential phosphorylation sites on the Ese proteins, these 
results also suggest that whether or not complexed with EpslS, Ese proteins are involved 
in intracellular signalling processes which are likely to lead to altered cellular activity. 
Many Ese partners have been identified in these studies. Also revealed is a novel method 
to identify more Ese partners. Yeast cells have a complex formed by two EH domain 
proteins (Panlp and End3p) which regulates both endocytosis as well as the actin 
cytoskeleton. Indeed, the Epsl5 protein has been reported to regulate both endocytosis 
and the actin cytoskeleton (1). As Epsl5 and Ese function together, and Ese contain many 
protein-protein interaction surfaces on this complex, strongly suggesting that the Ese 
proteins and their binding proteins are critical regulators of Epsl5:Ese functions in vivo. 

Taken together, these studies demonstrate the identification and isolation of two 
novel murine Ese proteins encoded by two novel murine Ese genes which are involved in 
the regulation of endocytosis via clathrin-coated pits. The process of endocytosis is most 
likely to be related to receptor-mediated endocytosis, however, may also be related to 
pinocytosis or non-receptor mediated endocytosis. The novel Ese proteins of the present 
invention appear to function to regulate endocytosis involving the formation of clathrin- 
coated pits by the polymerization of clathrin into a lattice along the cytosolic face of a 
region of membrane causing the region to expand inward. Ultimately, the pit pinches off 
from the membrane, and the clathrin cage is completed thus producing a coated vesicle. 
Through the binding of EsplS to form an Ese-Epsl5 complex via a central binding region, 
the complex can recruit other proteins such as assembly particles and dynamin to bind 
GTP and promote coated vesicle formation and perhaps their transport to appropriate 
locations within the cell and release of internalized proteins and/or molecules. The SH3 
C-terminal domains of the Ese proteins can bind and interact with several other proteins 



13 



CA 02230201 1998-04-27 



leading to a host of protein-protein interactions involved in the endocytotic process. The 
Ese proteins may be activated or inactivated via phosphorylation of the proteins at 
numerous phosphorylation sites by the action of activated receptors on cell surfaces. 
Due to the fact that the Ese proteins appear to be a key central player in the 
5 complex process of endocytosis involving protein-protein interactions and intracellular 
signalling, these proteins are most likely involved in a myriad of clinical conditions and 
processes which are very likely to include but not be limited to regulation of endocytosis 
fas described above), cell division and cancer (Epsl 5 and cbl-b are oncoproteins), cell 
migration (regulation of the actin cytoskeleton is required for many forms of cell 

10 migration), cell polarity, plane of cell division and cell fate specification (Epsl5 binds to 
Numb in vivo which is required for these processes(43), RNA localization (several RNA 
binding proteins have been identified in the present screens) and viral infection and life 
cycle (Epsl5 binds to RAB a cellular cofactor for HIV Rev (43)). 

With respect to viral infection Ese proteins may play an important role and thus 

] 5 may be a target for developing therapeutic strategies against viral infection and virally- 

induced disease states. HIV is known to alter endocytosis of several important cell surface 
molecules including CD4 and MHC antigens. HIV has been shown to bind to NEF which 
has been demonstrated to bind to SH3 sites. This binding to NEF induces clathrin coated 
pit formation. As NEF binds SH3 sites and induces endocytosis, it is possible that Ese 

20 proteins may bind to NEF and are involved in the endocytotic process. Therapeutic 

strategies to provide treatment for viral infection and virally induced disease states may 
therefore include the inhibition of Ese-NEF binding using antagonists, antibodies or other 
agents directed against such a complex to inhibit endocytosis and in this manner inhibit 
viral infection and virally induced disease states. It is also expected that many types of 

25 viruses will interact with the multi-component Epsl 5-Ese complex. 

Synaptic transmission and abnormal or altered synaptic transmission as seen in 
various nervous disorders may also be a target for the therapeutic use of Ese proteins 
and/or antagonists. The Epsl 5:Ese complex has been demonstrated to regulate 
endocytosis, is highly expressed in the brain and Ese binds to Jerky; a protein required to 

30 prevent epilepsy in mice (57). Futhermore, Ese proteins are highly expressed not only in 
the brain but also in the heart and in skeletal muscle which are tissues involving high levels 
of synaptic transmission. This suggests that Ese protein may be used in the treatment of 
nervous system disorders involving altered synaptic transmission. 
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Receptor-mediated cell signaling such as seen with several different types of 
growth factors also involves endocytosis. Over-expression, mutation or over-stimulation 
of growth factor receptors has been demonstrated to lead to abnormal ceil division and 
growth as seen in cancer. For example, EGF is a potent mitogen for many epithelial cells 
5 and EGF receptor activation is known to stimulate intracellular kinase pathways leading to 
cell proliferation. Such activity may play a role in cancer progression.By altering the rate 
of endocytosis by targeting Ese proteins, the cell proliferative effect of growth factor 

receptor stimulation may be counteracted. 

Abnormal cell division and cell migration is seen in several diseases and involves 
10 the cell cytoskeleton. The intracellular cytoskeleton is highly organized and consists of 

microtubules, microfilaments and intermediate filaments acting as an internal reinforcement 
in the cytoplasm of a cell. Together these structures associate in a regular and defined 
manner which is regulated by extracellular signals and may transduce plasma membrane 
signals by association with other proteins or by second messengers. The Ese-Epsl 5 
1 5 complex is very likely to regulate the cytoskeleton by analogy to the role of Panlp:End3p 
in regulation of yeast cytoskeleton. Furthermore, endocytosis is known to involve a 
rearrangement of the intracellular cytoskeleton. Cell division and migration require the 
continual rearrangement of the intracellular cytoskeleton. Therefore, abnormal patterns of 
cell division and migration may involve altered Ese function and altered endocytosis. The 
20 Ese protein or the gene may therefore be used to alter regulation of endocytosis or the 
association of the Ese protein with the cell cytoskeleton to restore cell division and 
migration to normal levels and patterns. 

Tissue development also involves the continual remodeling of the extracellular and 
intracellular cytoskeletal network along with associated proteins Developmental diseases 
25 can occur as a result of abnormal remodeling of the cytoskeleton leading to altered 

intracellular signaling. As Ese proteins are likely to be involved with both the cytoskeleton 
and intracellular signaling they may also be directly involved in the development of certain 
developmental diseases and therefore may be a target for therapeutic treatment of such 
diseases. Ese proteins may also be involved in normal development including that of stem 
30 cells which are self-renewing cells that divide to produce differentiated daughter cells in 
various tissues. As Ese proteins are associated with the cytoskeleton they may play a part 
in the formation of certain types of differentiated cells through the partitioning of RNA and 
proteins such as NuMb during cell division. 
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To summarize, Ese proteins, Ese complexes including Epsl 5, EpslSR and many of 
the proteins identified in the screens as well as others identified through similar screens can 
be targeted for use in therapies to treat diseases including cancer, viral infection based 
diseases, developmental diseases due to altered cell fate specification and/or division as 
5 well as neurological diseases and diseases of altered cell migration and other diseases due 
to defects in the actin cytoskeleton. 

EXAMPLES 

The examples are described for the purposes of illustration and are not intended to 
1 0 limit the scope of the invention. 

Methods of molecular genetics, protein and peptide biochemistry and immunology 
referred to but not explicitly described in this disclosure and examples are reported in the 
scientific literature and are well known to those skilled in the art. 

15 Material and Methods 

Ese cloning and plasmids 

High Stringency screening was used to isolate the two mouse Ese cDNAs by 
previously described methodology (58). Esel was cloned from an adult mouse brain 
cDNA library using a PCR product from nt 1707-2197 of the coding sequence as probe. 

20 Ese2 was cloned from a mixed tissue adult mouse cDNA library using a mixture of three 
probes (EST#583 881 (Research Genetics Inc.), EST#652549 (Research Genetics Inc.) 
and a PCR product from nt 2712 to nt 3456 of the Ese2 coding sequence. The mouse 
Epsl 5 cDNA was generated through a combination of high stringency library screening 
with Est sequences from the Epsl 5 gene and rt PCR according to established methods. 

25 

pcDNA3Esel: 

Full length Esel was cloned into the Notl site of pcDNA3 (lnvitrogen Inc.). The 
Esel cDNA includes 53 nucleotides of 5' UTR plus a natural Notl site and 288 
nucleotides of 3'UTR plus a small region of polylinker including a Notl site. 

30 

pq DNA3mvcEsel : 

The 5' end of pcDNA3Esel from the EcoRI she in the pcDNA3 polylinker to the 
start codon was replaced with the DNA sequence 
gAAT^ArxAACrATO GA^ 
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where the first underline corresponds to an EcoRI site which was fused into the pcDNA3 
EcoRI site and the extended underlined sequence codes for a myc-epitope tag. This is 
followed by nine nucleotides which code for glycine, proline and the natural Esel start codon. 
This sequence was joined to the sequences coding for amino acids 2-1213 (the remainder of 
Esel). The new start codon in this tagged Esel construct is bolded. The 3' end of Esel in 
this vector is the same as in pcDNA3Esel above 

pcDNA3Ese!5C : 

The C-terminus of Esel was removed by replacing all sequences from nt 2209 of 
the coding sequence to the Xhol site in pcDNA3Esel with TGA CTCGAG where the stop 
codon is in bold and the Xhol site is underlined. This construct codes for amino acids 1- 
736 of Esel. 

pCDNA3Eps!5 : 

This plasmid was constructed from four pieces. It contains the full length Epsl 5. 
The 5'UTR of this construct has been constructed to be GGATCCACCATG where a 
BamHI site is underlined and the start codon is bolded. This BamHI site was fused to the 
BamHI site in pcDNA3. The 3'UTR in this vector is 204 nt of the mouse natural 3'UTR 
fused to a short cloning linker ending in the sequence AAGCTTGGGCCC where an Apal 
site is underlined; this Apal site was fused to the Apal site in pcDNA3. 

pcDNA3Epsl56C : 

This vector is the same as pcDNA3Epsl5 except that sequences downstream from 
and including mouse Epsl 5 coding nucleotide 1500 have been replaced with 
CCT GGATTACAAGGATGATGATGACAAA TGACTrQA^ where the first underlined 
sequence codes for the Flag-epitope, an mframe stop codon is bolded and an Xhol site is 
underlined. This Xhol site was fused to the polylinker in pcDNA3. The resulting plasmid 
encodes amino acids 1-501 of mouse Epsl 5 fused to a C-terminal Flag epitope. The 5 s 
end of Epsl 5 in this construct is as indicated above for pcDNA3Epsl 5. 

pGBT9Eselcc : 

The Esel sequence coding for amino acids 330 to 732 were fused directly to 
GAATTC (EcoRI site) on the 5' end and to TA GGATCC (stop codon followed by a 
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BarnHI site) on the 3' end. This fragment was cloned into EcoRl/BamHI digested pGBT9 
in frame with the GAL4 DBD. 

P GBT9EselA3.3 : 

5 This plasmid encodes the bait for our SH3 screen. It encodes all five SH3 domains 

from amino acid 665-1213 and was subcloned into pGBT9 on an EcoRI fragment which 
fuses the Esel SH3 region in frame with the DBD of GAL4. 



Yeast 2-Hvbrid Screening 

10 pGBT9Eselcc was transformed into S. cerevisiae strain Y190 using 45% 

PEG4000, 100 mM LiAc, 10 mM Tris-HCl (pH 7.5) as per standard protocols. Cells were 
plated onto Sc-Trp drop-out media. Single colonies were isolated and expression of bait 
was analyzed by Western Blot using antibodies against the GAL4 DBD. A clone 
expressing the bait fusion was used to inoculate a 100 mL Sc-Trp liquid culture grown 

15 overnight at 30°C Cells were then re-inoculated into YPD at a density of 5xl0 6 cells/mL 
and grown at 30°C until the titer reached 2xl0 7 cells/mL. Cells were pelleted, resuspended 
in 50 mL 100 mM LiAc and incubated for 10 min at 30°C. Once again the cells were 
pelleted, resuspended in 20 mL PL A [3 5% PEG, 100 mM LiAc, 2 mg/mL salmon sperm 
DNA] containing 30 ^g plasmid library (cloned in pAD-GAL4) incubated at 30°C for 30 

20 min., then heat shocked at 42°C for 40 min,, pelleted, resuspended in water and plated 

onto [Sc-Trp-Leu-His+40 mM 3-AT]. Plates were incubated at 30°C until colonies were 
formed. Colonies were picked, patched and grown at 30°C overnight on Whatman filter 
paper laid on top of Sc-Trp-Leu-His plates. Filters were submerged in liquid nitrogen for 
15 seconds then placed on top of blotting paper soaked in Z-buffer + X-gal. p- 

25 galactosidase activity was measured by the appearance of blue colour. Plasrnids from p- 
galactosidase positive colonies were shuttled to bacteria by electroporation and isolated 
for sequencing. 

Northern Blot analysis 
30 A multiple tissue northern blot (Clontech) was prehybridized in 5 mL of 

ExpressHyb Solution (Clontech) at 68°C for 30 min. Probe was added at lxlO 6 cpm/mL 
for 1 hr. The Blot was washed twice [2X SSC, 0.05% SDS] at room temperature, twice 
[0. IX SSC, 0. 1% SDS] at 50°C and then exposed to film overnight. 
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Antibodies and Western Blot analysis 

Western blot analysis was performed according to standard protocols. Briefly, 
cultured cell lines or 48 hours post tranfection Cos-1 cells where washed with PBS and 
5 lysed in one ml of cold lysis buffer (50 mM Hepes pH 7.4, 1 50 mM NaCl, 1 0% glycerol, 
1 % Triton XI 00, 1 mM EGTA, 1 . 5 mM MgCl 2 , 1 0 mM NaF, 1 0 mg ml" 1 aprotinin, 1 mM 
PMSF, 10 mg ml" 1 leupeptin, 1 mM Na3VC>4). Supernatants were clarified by 

centrif u gatinn an d i mmiinnprer . ipitatfiri with specifi c antiserum as indicated. Antigen - 

Antibody complexes where purified on anti-rabbit agarose or anti-mouse agarose (Sigma 

1 0 Chemical Co.) Samples were run on 7.5% PAGE gels and transferred to nitrocellulose 
membranes. Filters were blocked in 5% dry milk powder/0.05% Tween 20/PBS, washed 
in 1% dry milk powder/0.05% Tween 20/PBS, and probed with the appropriate antisera at 
Img/ml in wash buffer (In the case of chicken anti-Esel we used 10u,g/ml to probe 
western blots). Probed filters were further washed, probed again with 1/5000 dilution of 

15 horseradish peroxidase conjugated anti-rabbit IgG, anti-mouse IgG antibodies 

(Amersham), or anti-chicken-IgY (Zymed); washed and signal detected using the 
Enhanced Chemiluminescence detection system as per manufacturers instructions 
(Amersham). 

Rabbit anti-Esel antisera was raised against a peptide of the following sequence; 

20 MAQFPTPFGGSLDVWAITVEE. The antisera was affinity purified over the same 

peptide (Research Genetics). This peptide was also used at 5ng to compete for the 5 jag of 
antibody per immunoprecipitation reaction. Chicken anti-Esel antisera was raised against 
a fusion protein between GST and amino acids 665-1213 of mouse Esel. This sera was 
cleared of antibodies reacting against GST by incubation with glutathione s-transferase on 

25 glutathione agarose beads. 

Immunofluorescence 

For Immunofluorescent staining, Cos-1 cells were typically plated at a density of 
2xl0 5 per 22x50mm coverglass and transfected with 2.5u.g of plasmid using Superfect 
30 (Qiagen Inc.). After 2 hours, the cells were washed with 10%FBS in Iscove's Media and 
fed with fresh 10%FBS in Iscove's Media. Two days later, these cultures where fixed 
with cold methanol for 30 minutes at room temperature. Cultures were washed three times 
10 minutes with Phosphate buffered saline (PBS), blocked for one hour at room 
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temperature with 1%BSA in PBS and then incubated with primary antibody in blocking 
solution for one hour, also at room temperature. Slides were then washed three times 10 
minutes with PBS, incubated with secondary antibody/1% BSA/PBS in the dark for one 
hour at room temperature. Finally, slides were washed three times 1 0 minutes in the dark 
at room temperature and mounted using Dabco anti-fade solution (Sigma Chemical Co.). 
Slides were analyzed on the confocal microscope using the 63x objective and optical filters 
to separate signals on each channel. 

For primary antibodies Mouse anti-myc monoclonal 9E10 (10|j.g/ml: Santa Cruz 



Biotech ), Rabbit anti-myc (5[ig/mi: Upstate Biotechnology Inc.), Rabbit anti-Epsl5 
10 antisera #C20 (1 ng/ml: Santa Cruz Biotech), Rabbit anti-Flag epitope antisera (5|j.g/ml: 

Zymed) and mouse anti-Dynamin I #D25520, which recognizes Dynamin in Cos-1 cells by 
both immunoprecipitation and western blotting (data not shown) (20ng/ml: Transduction 
labs, Inc.) were used. As secondary antibodies we used FITC-labelled goat anti-mouse 
(1:80 dilution) and Texas Red-labelled goat anti-rabbit antibodies (1:100 dilution) 
1 5 (Jackson Immuno Research Laboratories Inc.). 
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Mouse Esel cDNA - Sequence ID NO: 1 

CGGC AC GAGGAGG AGT GGAGCGGC GC GGG AGGGCGCGC AGCTTGGTTGCTCC 

GTAGTACGGCGGCTCGCAAGGGAGCATCCCGAGCGGGCTCCGGGACGGCCGG 

GAGGCAGGCAGGCGGGCGGGCGGGGATGGTGTGCGCGGCTGCGGACTCGGCG 

TTCCTCGCGCGGCGTGCGGGCTGCACTGATTTGTGTGAGGGGCGGCCGCGCGC 

ACCCGCCCGGAGATGAGGCGTCGATCAGCAAGGTGAACGTAATAGAACCATG 

GCTCAGTTTCCCACACCTTTCGGTGGTAGCCTGGATGTCTGGGCCATAACTGT 

GGAGGAAAGGGCCAAGCATGACCAGCAGTTCCTTAGCCTGAAGCCGATAGCG 

GGATTTATTACTGGTGATCAAGCGAGGAACTTTTTTTTCCAATCTGGGTTACCT 

CAGCCTGTCTTAGCACAAATATGGGCGCTAGCGGACATGAATAACGATGGAA 

GGATGGATCAAGTGGAATTTTCCATAGCCATGAAGCTTATCAAACTGAAGCTA 

CAAGGATATCAGCTCCCCTCCACACTTCCCCCTGTCATGAAACAGCAACCAGT 

GGCTATTTCCAGTGCACCAGCATTTGGTATAGGAGGGATTGCTAGCATGCCAC 

CACTCACAGCTGTTGCTCCTGTGCCAATGGGCTCCATTCCAGTTGTTGGAATGT 

CTCCACCCTTAGTATCTTCTGTCCCTCCAGCAGCAGTGCCTCCCCTGGCTAACG 

GGGCTCCTCCCGTCATACAGCCTCTGCCTGCGTTTGCGCATCCTGCAGCCACAT 

GGCCAAAGAGTTCTTCCTTCAGCAGATCTGGTCCAGGGTCACAATTAAACACT 

AAGTTACAGAAGGCACAATCATTCGATGTCGCCAGCGCCCCTCCAGCAGCAGA 

ATGGGCTGTGCCTCAGTCATCAAGGCTGAAATACAGGCAGTTATTCAACAGCC 

ACGACAAAACTATGAGTGGACACTTAACAGGTCCCCAGGCAAGAACTATTCTC 

ATGCAATCAAGTTTACCCCAGGCTCAGCTGGCTTCAATATGGAATCTTTCTGA 

CATTGATCAAGATGGAAAACTCACTGCAGAAGAATTTATCCTAGCTATGCACC 

TAATTGATGTTGCCATGTCTGGTCAGCCACTGCCGCCCGTCCTGCCTCCAGAAT 

ACATCCCTCCTTCCTTCAGAAGAGTTCGCTCCGGCAGTGGGATGTCCGTCATA 

AGCTCTTCTTCTGTGGATCAGAGGCTGCCTGAGGAGCCGTCGTCAGAGGATGA 

GCAGCAGCCAGAGAAGAAACTGCCTGTGACATTTGAAGATAAGAAGCGGGAG 

AACTTCGAGCGAGGCAGTGTGGAGCTGGAGAAGCGCCGCCAAGCGCTCTTGG 

AGCAGCAGCGCAAAGAGCAGGAGCGGTTGGCTCAGCTGGAGCGCGCCGAGCA 

GGAGAGGAAAGAGCGGGAGCGCCAGGAGCAGGAGGCCAAGCGGCAGCTGGA 

GCTGGAGAAGCAGCTGGAGAAGCAGCGGGAGCTGGAGCGGCAGCGAGAGGA 

GGAGAGGAGGAAGGAGATCGAGAGGCGCGAGGCCGCAAAACGGGAACTGGA 

AAGGCAGCGACAACTTGAATGGGAACGGAACCGGAGACAGGAACTCCTGAAT 

CAGAGGAACAAGGAGCAGGAGGGCACCGTGGTCCTGAAGGCAAGGAGGAAG 
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ACTCTGGAGTTTGAGTTAGAAGCTCTGAATGACAAAAAGCATCAGCTAGAAGG 
AAAACTTCAGGATATCAGGTGTCGACTGGCAACCCAGAGGCAAGAAATTGAG 
AGCACGAACAAGTCTAGAGAGCTAAGAATTGCTGAAATCACCCACTTACAGCA 
GCAGTTGCAGGAATCTCAGCAAATGCTTGGAAGACTTATTCCAGAGAAACAGA 
5 TACTCAGTGACCAGTTAAAACAAGTCCAGCAGAACAGTTTGCATAGAGACTCG 
CTTCTTACCCTCAAAAGAGCCTTGGAAGCAAAGGAGCTGGCCCGGCAGCAGCT 
CCGGGAGCAGCTGGACGAGGTGGAGAGAGAGACCAGGTCAAAGCTGCAGGA 
GATTGATGTTTTCAACAACCAGCTGAAGGAACTGAGAGAGATACATAGCAAAC 
AGCAACTCCAGAAGCAGAGGTCCCTGGAGGCAGCGCGACTGAAGCAGAAAGA 
1 0 GCAGGAGAGGAAGAGCCTGGAGTTAGAGAAGCAAAAGGAAGACGCTCAGAG 
ACGAGTTCAGGAAAGGGACAAGCAATGGCTGGAGCATGTGCAGCAGGAGGAG 
CAGCCACGCCCCCGGAAACCCCACGAGGAGGACAGACTGAAGAGGGAAGACA 
GTGTCAGGAAGAAGGAGGCGGAAGAGAGAGCCAAGCCGGAAATGCAAGACA 
AGCAGAGTCGGCTTTTCCATCCGCATCAGGAGCCAGCTAAGCTGGCCACCCAG 

15 GCACCCTGGTCTACCACAGAGAAAGGCCCGCTTACCATTTCTGCACAGGAGAG 
TGTAAAAGTGGTATATTACCGAGCGCTGTACCCCTTTGAATCCAGAAGTCACG 
ATGAGATCACCATCCAGCCAGGAGATATAGTCATGGTGGATGAAAGCCAGAC 
TGGAGAGCCAGGATGGCTTGGAGGAGAGCTGAAAGGGAAGACGGGATGGTTC 
CCTGCAAACTATGCAGAAAAGATTCCAGAAAATGAGGTTCCCACTCCAGCCAA 

20 ACCAGTGACCGATCTGACATCTGCCCCTGCCCCCAAACTGGCTCTGCGTGAGA 
CCCCTGCTCCTTTGCCAGTGACCTCTTCTGAGCCCTCCACAACCCCCAACAACT 
GGGCAGACTTCAGTTCCACGTGGCCCAGCAGCTCAAACGAGAAGCCAGAAAC 
GGACAACTGGGATACGTGGGCGGCTCAGCCTTCTCTGACCGTACCTAGTGCTG 
GCCAGTTACGGCAGAGATCAGCCTTTACCCCAGCCACAGCCACTGGCTCCTCC 

25 CCATCTCCCGTCCTGGGCCAGGGTGAAAAGGTGGAAGGGCTACAAGCGCAAG 
CCCTGTATCCCTGGAGAGCCAAAAAAGACAACCACTTAAATTTTAACAAAAGT 
GACGTCATCACCGTTCTGGAACAGCAAGACATGTGGTGGTTTGGAGAAGTTCA 
AGGTCAGAAGGGTTGGTTCCCCAAGTCTTACGTGAAACTCATTTCAGGGCCCG 
TAAGGAAATCCACAAGCATCGATACTGGCCCTACTGAAAGTCCTGCTAGTCTA 

30 AAGAGAGTGGCTTCCCCGGCCGCCAAGCCAGCCATTCCCGGAGAAGAGTTTAT 
TGCCATGTACACATACGAGAGTTCTGAGCAAGGAGATTTAACCTTTCAGCAAG 
GGGATGTGATTGTGGTTACCAAGAAAGATGGTGACTGGTGGACGGGAACGGT 
GGGCGACAAGTCCGGAGTCTTCCCTTCTAACTATGTGAGGCTTAAAGATTCAG 
AGGGCTCTGGAACTGCTGGGAAAACAGGGAGTTTAGGAAAAAAACCTGAAAT 
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TGCCCAGGTTATTGCTTCCTACGCTGCTACTGGTCCCGAACAACTCACCCTGGC 
TCCTGGGCAGCTGATTCTGATCCGGAAAAAGAACCCAGGTGGATGGTGGGAA 
GGAGAACTGCAAGCTCGAGGGAAAAAGCGCCAGATAGGGTGGTTTCCAGCAA 
ATTATGTCAAACTTCTAAGCCCCGGAACAAGCAAAATCACCCCAACTGAGCTA 
5 CCCAAGACCGCAGTGCAGCCAGCAGTGTGCCAGGTGATCGGGATGTACGATT 
ACACCGCCCAGAACGATGACGAACTAGCCTTCAGCAAAGGCCAGATCATCAAC 
GTCCTCAACAAGGAGGACCCGGACTGGTGGAAAGGAGAAGTCAGTGGGCAAG 
TTGGGCTCTTCCCATCCAATTATGTAAAGCTGACCACAGACATGGACCCCAGC 
CAGCAATGAATCATATGTTGTCCATCCCCCCCTCAGGCTTGAAAGTCCTCAAA 

1 0 GAGACCCACTATCCCATATCACTGCCCAGAGGGATGATGGGAGATGCAGCCTT 
GATCATGTGACTTGCAGCATGATCACCTACTGCCTTCTGAGTAGAAGAACTCA 
CTGCAGAGCAGTTTACCTCATTTGACCTTAGTTGCATGTGATCGAAATGTCTG 
AGTCACTGCGTGCAGAGGCAGAAGCAAATTGCAGAACTGCACAGGGTGGTGG 
GTCCTTTTGGGGCTTTCCTAGTCACTCAGACTGACCGGCCCCGCCTTCACACG 

1 5 GGCGCTTTCAATAGTTTTAAGATTATTTTTAAATGTGTATTTTAGCCTTTTAAT 
AAAAATCTCAATCAATTACTTCTTTGCCTATTTTGGTTTTACAAAAACACCCAC 
TATCAAGGAGTGCCTGTCTGCGGACGATTAAAATGCTGTTCCGGGCGTACCGT 
AAACTGAGAGCTTGCTGTACCTTTGCCGTTTGTCCAGTGTTCCCAACCACATTG 
TGTAGTTTGGGGCTGTTCCCTGCCGTAGAGCACAGAGGAGATGGGTGTACCTG 

20 TTTTGAAAATGTGTATGTAGACTGAGCCTGACTATGGAAGGGGTTATGCTTGT 
CTGTGACCATCACGTGTACCTGTCGCGCATGTACCATCTGTACCGAAGAAGTA 
GCTCTTCCTCCATGGCTAAACCCACCACCGTGTACAGTGCTCTCATCTACTGCA 
TTCATTTTACTTTGCACAGTGACCTTGTAGCCACCTGAGGAAGCACCCATGTTT 
CCGTTTGGTCTCAGATGTACCTAGTTGTGCCCGTGTTTTGTTTTTATTTTTCAA 

2 5 TCTGGCATGTCTTC AC ACCATAAACTAGTAAGACGCC AACTGCCCAGGCGGTT 

ACGATCATCAGTACCCACCGTCTTAGTCTCTGTTACGTGAAGTTTATTCCAGTT 
GCTTTTTATGGAATATCTTGAACAAGTAATCTTCTTGACAAGAAAGAATGTAT 
AGAAGTCTCCCTGCAATTAATTTCCCAGTGTTTACATTTTTTAACTAGACTGTG 
GGGGTTGCTACAGATTAATATGAAATGGCGCTCCTGGTCCGTGTGTGTGTTAA 

3 0 CTTGTGCTGT AGCTGAAGCCGTGTGTCCTTAGATATTAGTTGGAAGTCGGGAA 

GAGAATTCGATATCAAGCTT 
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Mouse Esel coding sequence - Sequence ID NO:2 

ATGGCTCAGTTTCCCACACCTTTCGGTGGTAGCCTGGATGTCTGGGCCATAAC 
5 TGTGGAGGAAAGGGCCAAGCATGACCAGCAGTTCCTTAGCCTGAAGCCGATA 
GCGGGATTTATTACTGGTGATCAAGCGAGGAACTTTTTTTTCCAATCTGGGTT 
ACCTCAGCCTGTCTTAGCACAAATATGGGCGCTAGCGGACATGAATAACGATG 
GAAGGATGGATCAAGTGGAATTTTCCATAGCCATGAAGCTTATCAAACTGAAG 
CTACAACKiATATCAGCTCCCCTC^^ 

10 AGTGGCTATTTCCAGTGCACCAGCATTTGGTATAGGAGGGATTGCTAGCATGC 
CACCACTCACAGCTGTTGCTCCTGTGCCAATGGGCTCCATTCCAGTTGTTGGA 
ATGTCTCCACCCTTAGTATCTTCTGTCCCTCCAGCAGCAGTGCCTCCCCTGGCT 
AACGGGGCTCCTCCCGTCATACAGCCTCTGCCTGCGTTTGCGCATCCTGCAGC 
CACATGGCCAAAGAGTTCTTCCTTCAGCAGATCTGGTCCAGGGTCACAATTAA 

15 ACACTAAGTTACAGAAGGCACAATCATTCGATGTCGCCAGCGCCCCTCCAGCA 
GCAGAATGGGCTGTGCCTCAGTCATCAAGGCTGAAATACAGGCAGTTATTCAA 
CAGCCACGACAAAACTATGAGTGGACACTTAACAGGTCCCCAGGCAAGAACT 
ATTCTCATGCAATCAAGTTTACCCCAGGCTCAGCTGGCTTCAATATGGAATCTT 
TCTGACATTGATCAAGATGGAAAACTCACTGCAGAAGAATTTATCCTAGCTAT 

20 GCACCTAATTGATGTTGCCATGTCTGGTCAGCCACTGCCGCCCGTCCTGCCTCC 
AGAATACATCCCTCCTTCCTTCAGAAGAGTTCGCTCCGGCAGTGGGATGTCCG 
TCATAAGCTCTTCTTCTGTGGATCAGAGGCTGCCTGAGGAGCCGTCGTCAGAG 
GATGAGCAGCAGCCAGAGAAGAAACTGCCTGTGACATTTGAAGATAAGAAGC 
GGGAGAACTTCGAGCGAGGCAGTGTGGAGCTGGAGAAGCGCCGCCAAGCGCT 

25 CTTGGAGCAGCAGCGCAAAGAGCAGGAGCGGTTGGCTCAGCTGGAGCGCGCC 
GAGCAGGAGAGGAAAGAGCGGGAGCGCCAGGAGCAGGAGGCCAAGCGGCAG 
CTGGAGCTGGAGAAGCAGCTGGAGAAGCAGCGGGAGCTGGAGCGGCAGCGA 
GAGGAGGAGAGGAGGAAGGAGATCGAGAGGCGCGAGGCCGCAAAACGGGAA 
CTGGAAAGGCAGCGACAACTTGAATGGGAACGGAACCGGAGACAGGAACTCC 

30 TGAATCAGAGGAACAAGGAGCAGGAGGGCACCGTGGTCCTGAAGGCAAGGAG 
GAAGACTCTGGAGTTTGAGTTAGAAGCTCTGAATGACAAAAAGCATCAGCTAG 
AAGGAAAACTTCAGGATATCAGGTGTCGACTGGCAACCCAGAGGCAAGAAAT 
TGAGAGCACGAACAAGTCTAGAGAGCTAAGAATTGCTGAAATCACCCACTTAC 
AGCAGCAGTTGCAGGAATCTCAGCAAATGCTTGGAAGACTTATTCCAGAGAAA 
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CAGATACTCAGTGACCAGTTAAAACAAGTCCAGCAGAACAGTTTGCATAGAGA 
CTCGCTTCTTACCCTCAAAAGAGCCTTGGAAGCAAAGGAGCTGGCCCGGCAGC 
AGCTCCGGGAGCAGCTGGACGAGGTGGAGAGAGAGACCAGGTCAAAGCTGCA 
GGAGATTGATGTTTTCAACAACCAGCTGAAGGAACTGAGAGAGATACATAGC 
5 AAACAGCAACTCCAGAAGCAGAGGTCCCTGGAGGCAGCGCGACTGAAGCAGA 
AAGAGCAGGAGAGGAAGAGCCTGGAGTTAGAGAAGCAAAAGGAAGACGCTC 
AGAGACGAGTTCAGGAAAGGGACAAGCAATGGCTGGAGCATGTGCAGCAGGA 
GGAGCAGCCACGCCCCCGGAAACCCCACGAGGAGGACAGACTGAAGAGGGAA 
GACAGTGTCAGGAAGAAGGAGGCGGAAGAGAGAGCCAAGCCGGAAATGCAA 
1 0 GACAAGCAGAGTCGGCTTTTCCATCCGCATCAGGAGCCAGCTAAGCTGGCC AC 
CCAGGCACCCTGGTCTACCACAGAGAAAGGCCCGCTTACCATTTCTGCACAGG 
AGAGTGTAAAAGTGGTATATTACCGAGCGCTGTACCCCTTTGAATCCAGAAGT 
CACGATGAGATCACCATCCAGCCAGGAGATATAGTCATGGTGGATGAAAGCC 
AGACTGGAGAGCCAGGATGGCTTGGAGGAGAGCTGAAAGGGAAGACGGGAT 
1 5 GGTTCCCTGCAAACTATGCAGAAAAGATTCCAGAAAATGAGGTTCCCACTCC A 
GCCAAACCAGTGACCGATCTGACATCTGCCCCTGCCCCCAAACTGGCTCTGCG 
TGAGACCCCTGCTCCTTTGCCAGTGACCTCTTCTGAGCCCTCCACAACCCCCAA 
CAACTGGGCAGACTTCAGTTCCACGTGGCCCAGCAGCTCAAACGAGAAGCCA 
GAAACGGACAACTGGGATACGTGGGCGGCTCAGCCTTCTCTGACCGTACCTAG 
20 TGCTGGCCAGTTACGGCAGAGATCAGCCTTTACCCCAGCCACAGCCACTGGCT 
CCTCCCCATCTCCCGTCCTGGGCCAGGGTGAAAAGGTGGAAGGGCTACAAGCG 
CAAGCCCTGTATCCCTGGAGAGCCAAAAAAGACAACCACTTAAATTTTAACAA 
AAGTGACGTCATCACCGTTCTGGAACAGCAAGACATGTGGTGGTTTGGAGAAG 
TTCAAGGTCAGAAGGGTTGGTTCCCCAAGTCTTACGTGAAACTCATTTCAGGG 
25 CCCGTAAGGAAATCCACAAGCATCGATACTGGCCCTACTGAAAGTCCTGCTAG 
TCTAAAGAGAGTGGCTTCCCCGGCCGCCAAGCCAGCCATTCCCGGAGAAGAGT 
TTATTGCCATGTACACATACGAGAGTTCTGAGCAAGGAGATTTAACCTTTCAG 
CAAGGGGATGTGATTGTGGTTACCAAGAAAGATGGTGACTGGTGGACGGGAA 
CGGTGGGCGACAAGTCCGGAGTCTTCCCTTCTAACTATGTGAGGCTTAAAGAT 
3 0 TC AGAGGGCTCTGGAACTGCTGGGAAAAC AGGGAGTTTAGGAAAAAAACCTG 
AAATTGCCCAGGTTATTGCTTCCTACGCTGCTACTGGTCCCGAACAACTCACCC 
TGGCTCCTGGGCAGCTGATTCTGATCCGGAAAAAGAACCCAGGTGGATGGTG 
GG AAGGAGAACTGC A AGCTCGAGGG AAAAAGCGCC AG AT A GGGTGGTTTCC A 
GCAAATTATGTCAAACTTCTAAGCCCCGGAACAAGCAAAATCACCCCAACTGA 
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GCTACCCAAGACCGCAGTGCAGCCAGCAGTGTGCCAGGTGATCGGGATGTAC 
GATTACACCGCCCAGAACGATGACGAACTAGCCTTCAGCAAAGGCCAGATCAT 
CAACGTCCTCAACAAGGAGGACCCGGACTGGTGGAAAGGAGAAGTCAGTGGG 
CAAGTTGGGCTCTTCCCATCCAATTATGTAAAGCTGACCACAGACATGGACCC 
5 CAGCCAGCAATGAATCATATGTTGTCCATCCCCCCCTCAGGCTTGAAAGTCCT 
CAAAGAGACCCACTATCCCATATCACTGCCCAGAGGGATGA 

Mouse Esel protein - Sequence ID NO:3 

1 0 MAQFPTPFGGSLDVWAITVBERAKHDQQFLSLKP1AGFITGDQARNFFFQSGLPQP 
VLAQIWALADMNNDGRMDQVEFSIAMKL S 
SAPAFGIGGIASMPPLTAVAPVPMGSEPVVGMSPPLVSSVPPAAVPPLANGAPPVIQ 
PLPAFAHPAATWPKSSSFSRSGPGSQLNTKLQKAQSFDVASAPPAAEWAVPQSSR 
LKYRQLFNSFTOKTMSGFILTGPQARTn.MQSSLPQAQLASrWNLSDIDQDGKLTAE 

1 5 EFILAMHLIDVAMSGQPLPPVLPPEYtPPSFRRVRSGSGMSVISSSSVDQRLPEEPSS 
EDEQQPEKKLPVTFEDKKRENFERGSVELEKRRQALLEQQRKEQERLAQLERAEQ 
ERKERERQEQEAKRQLELEKQLEKQRELERQREEERRKEIERREAAKRELERQRQ 
LEWERNRRQELLNQRNKEQEGTVVLKARRKTLEFELEALNDKKHQLEGKLQDIR 
CRLATQRQEIESTNKSRELRIAEITHLQQQLQESQQMLGRLIPEKQILSDQLKQVQ 

20 QNSLHRDSLLTLKRALEAKELARQQLREQLDEVERETRSKLQEIDVFNNQLKELR 
EIHSKQQLQKORSLEAARLKQKEQERKSLELEKQKEDAQRRVQERDKQWLEHVQ 
QEEQPRPRKPHEEDRLKREDSVRKKEAEERAKPEMQDKQSRLFHPHQEPAKLAT 
QAPWSTTEKGPLTISAQESVKVVYYRALYPFESRSHDEITIQPGDIVMVDESQTGE 
PGWLGK3ELKGKTGWFPANYAEKIPENEVPTPAKPVTDLTSAPAPKLALRETPAPL 

25 PVTSSEPSTTPNNWADFSSTWPSSSNEKPETDNWDTWAAQPSLTVPSAGQLRQRS 
AFTP AT ATGS SP SP VLGQGEKVEGLQ AQALYPWRAKKDNHLNFNKSD VITVLEQ 
QDMWWFGEVQGQKGWFPKSYVKLISGPVRKSTSIDTGPTESPASLKRVASPAAKP 
AIPGEEFIAMYTYESSEQGDLTFQQGDVIVVrKKDGDWWTGTVGDKSGVFPSNY 
VRLKDSEGSGTAGKTGSLGKKPE1AQVIASYAATGPEQLTLAPGQLILIRKKNPGG 

30 WWEGELQARGKKRQIGWFPANYVKLLSPGTSKITPTE 

LPKTAVQPAVCQVIGMYDYTAQNDDELAFSKGQITNVLNKEDPDWWKGEVSGQ 
VGLFPSNYVKLTTDMDPSQQ 



33 



CA 02230201 1998-04-27 



Mouse Ese2 cDNA - Sequence ID NO:4 

CCCTTCCTTTCCTTTTTTTGTGTTCGCCTTCGGCCGTGCCGGCTGAGAGCCCAG 
5 CAGCCGTGACAGGCTGCGCAACAGGTTCGCTGCGGCCGGCCTGACGACTGAC 
CCGGCGGCGGCGGCCGCGGCACGGCAGGGTCTTCCCGGAGCTTGGCCGCGCC 
CACGCGCCGGTGTCGAGGAGCGCGCGGGGTCGCGCCGGGACGTGCGCGAGGC 
GCCAGATGGCTGAGAGCTGCAAGAAGAAGTCAGGATCATGATGGCTCAGTTT 
CCCACAGCGATGAATGGAGGGCCAAATATGTGGGCTATTACATCTGAAGAAC 

1 0 GTACTAAGCATGATAAACAGTTTGATAACCTCAAACCTTCAGGAGGTTACATA 
ACAGGTGATCAAGCCCGTACTTTTTTCCTACAGTCAGGTCTGCCGGCCCCGGT 
TTTAGCTGAAATATGGGCCTTATCAGATCTGAACAAGGATGGGAAGATGGACC 
AGCAAGAGTTCTCTATAGCTATGAAACTCATCAAGTTAAAGTTGCAGGGCCAA 
CAGCTGCCTGTAGTCCTCCCTCCTATCATGAAACAACCCCCTATGTTCTCTCCA 

1 5 CTAATCTCTGCTCGTTTTGGGATGGGAAGCATGCCCAATCTGTCCATTCATCAG 
CCATTGCCTCCAGTTGCACCTATAGCAACACCCTTGTCTTCTGCTACGTCAGGG 
ACCAGTATTCCTCCCCTAATGATGCCTGCTCCCCTAGTGCCTTCTGTTAGTACA 
TCCTCATTACCAAATGGAACTGCCAGTCTCATTCAGCCTTTATCCATTCCTTAT 
TCTTCTTCAACATTGCCTCATGCATCATCTTACAGCCTGATGATGGGAGGATTT 

20 GGTGGTGCTAGTATCCAGAAGGCCCAGTCTCTGATTGATTTAGGATCTAGTAG 
CTCAACTTCCTCAACTGCTTCCCTCTCAGGGAACTCACCTAAGACAGGGACCT 
CAGAGTGGGCAGTTCCTCAGCCTTCAAGATTAAAGTATCGGCAAAAATTTAAT 
AGTCTAGACAAAGGCATGAGCGGATACCTCTCAGGTTTTCAAGCTAGAAATGC 
CCTTCTTCAGTCAAATCTCTCTCAAACTCAGCTAGCTACTATTTGGACTCTGGC 

25 TGACATCGATGGTGACGGACAGTTGAAAGCTGAAGAATTTATTCTGGCGATGC 
ACCTCACTGACATGGCCAAAGCTGGACAGCCACTACCACTGACGTTGCCTCCC 
GAGCTTGTCCCTCCATCTTTCAGAGGGGGAAAGCAAGTTGATTCTGTTAATGG 
AACTCTGCCTTCATATCAGAAAACACAAGAAGAAGAGCCTCAGAAGAAACTGC 
CAGTTACTTTTGAGGACAAACGGAAAGCCAACTATGAACGAGGAAACATGGA 

30 GCTGGAGAAGCGACGCCAAGTGTTGATGGAGCAGCAGCAGAGGGAGGCTGAA 
CGCAAAGCCCAGAAAGAGAAGGAAGAGTGGGAGCGGAAACAGAGAGAACTG 
CAAGAGCAAGAATGGAAGAAGCAGCTGGAGTTGGAGAAACGCTTGGAGAAAC 
AGAGAGAGCTGGAGAGACAGCGGGAGGAAGAGAGGAGAAAGGAGATAGAAA 
GACGAGAGGCAGCAAAACAGGAGCTTGAGAGACAACGCCGTTTAGAATGGGA 
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AAGACTCCGTCGGCAGGAGCTGCTCAGTCAGAAGACCAGGGAACAAGAAGAC 
ATTGTCAGGCTGAGCTCCAGAAAGAAAAGTCTCCACCTGGAACTGGAAGCAGT 
GAATGGAAAACATCAGCAGATCTCAGGCAGACTACAAGATGTCCAAATCAGA 
AAGCAAACACAAAAGACTGAGCTAGAAGTTTTGGATAAACAGTGTGACCTGG 
5 AAATTATGGAAATCAAACAACTTCAACAAGAGCTTAAGGAATATCAAAATAAG 
CTTATCTATCTGGTCCCTGAGAAGCAGCTATTAAACGAAAGAATTAAAAACAT 
GCAGCTCAGTAACACACCTGATTCAGGGATCAGTTTACTTCATAAAAAGTCAT 
CAGAAAAGGAAGAATTATGCCAAAGACTTAAAGAACAATTAGATGCTCTTGA 
AAAAGAAACTGCATCTAAGCTCTCAGAAATGGATTCATTTAACAATCAGCTGA 

1 0 AGGAACTCAGAGAAAGCTATAATACACAGCAGTTAGCCCTTGAACAACTTCAT 
AAAATCAAACGTGACAAATTGAAGGAAATCGAAAGAAAAAGATTAGAGCAAA 
TTCAAAAAAAGAAACTAGAAGATGAGGCTGCAAGGAAAGCAAAGCAAGGAAA 
AGAAAACTTGTGGAGAGAAAGTATTAGAAAGGAAGAAGAGGAAAAGCAAAA 
ACGACTCCAGGAAGAAAAGTCACAGGACAAAACTCAAGAAGAGGAACGAAAA 

1 5 GCTGAGGCAAAACAAAGTGAGACAGCCAGTGCTTTGGTGAATTACAGAGCAC 
TGTACCCTTTTGAAGCAAGAAACCATGATGAGATGAGTTTTAGTTCTGGGGAT 
ATAATTCAGGTTGATGAAAAAACTGTAGGAGAGCCTGGTTGGCTTTATGGTAG 
TTTTCAGGGAAAGTTTGGCTGGTTCCCCTGCAACTATGTAGAAAAAGTGCTGT 
CAAGTGAAAAAGCTCTGTCTCCTAAGAAGGCCTTACTTCCTCCTACAGTGTCT 

20 CTCTCTGCTACCTCAACTTCTTCCCAGCCACCAGCATCAGTGACTGATTATCAC 
AATGTATCCTTCTCAAACCTTACTGTTAATACAACATGGCAGCAGAAGTCAGC 
TTTTACCCGCACTGTGTCCCCTGGATCTGTGTCCCCCATTCACGGACAGGGGC 
AGGCTGTAGAAAACCTGAAAGCCCAGGCCCTTTGTTCCTGGACGGCAAAGAA 
GGAGAACCACCTGAACTTCTCAAAGCACGACGTCATCACTGTCCTGGAGCAGC 

25 AGGAAAACTGGTGGTTTGGGGAGGTGCACGGAGGAAGAGGATGGTTCCCCAA 
GTCTTATGTCAAGCTCATTCCTGGGAATGAAGTACAGCGAGGAGAGCCAGAA 
GCTTTGTATGCAGCTGTGACTAAGAAACCTACCTCCACAGCCTATCCAGTTAC 
CTCCACAGCCTATCCAGTTGGAGAAGACTACATTGCACTTTATTCATACTCAA 
GTGTAGAGCCCGGGGATTTGACTTTCACTGAAGGTGAAGAAATTCTAGTGACC 

3 0 CAGAAAGATGGAGAGTGGTGG ACAGGAAGTATTGGAGAGAGAACTGGAATCT 
TCCCGTCCAACTACGTCAGACCAAAGGATCAAGAGAATTTTGGGAATGCTAGC 
AAATCTGGAGCATCAAACAAAAAACCCGAGATCGCTCAAGTAACTTCAGCATA 
TGCTGCTTCAGGGACTGAGCAGCTCAGCCTTGCGCCAGGACAGTTAATATTAA 
TCTTAAAGAAAAACACAAGCGGGTGGTGGCAAGGAGAGCTACAGGCCAGAGG 
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GAAGAAACGACAGAAGGGATGGTTTCCTGCCAGCCATGTAAAGCTGCTAGGT 
CCAAGCAGTGAAAGAACCATGCCTACTTTTCACGCTGTATGTCAAGTGATTGC 
TATGTATGACTACATGGCGAATAACGAAGATGAGCTCAATTTCTCCAAAGGAC 
AGCTGATTAATGTTATGAACAAAGATGACCCTGACTGGTGGCAAGGAGAAAC 
5 CAATGGTCTGACTGGTCTCTTTCCTTCAAACTATGTTAAGATGACAACAGACTC 
AGATCCAAGTCAACAGTGGTGTGCTGACCTCCAAGCCCTGGACACAATGCAGC 
CTACGGAGAGGAAGCGACAGGGCTACATTCACGAGCTCATTCAGACAGAGGA 
GCGGTACATGGACGACCTGCAACTTTTTGAACAAAAAACTCTCCTTTGAGGGC 
CTGGGGAAGCCAGAACCAGGGGAGCTGCCCACAAGGCTGGGTCTAAAGACAG 

10 ATTTTGCTCTCCCAGGACAGAGGAGCATCACATCGGCTTCATCCATCCAAACA 
AGCCACACTCGCTGGGCCTGGTATTTTATTGCACCACTAAAATTGCTAGCAAT 
CTATGCAAACATGATCTTTTTAAACAAACGCCACAGCACAGTGCCTTGTACTA 
GTGTTAACCTGTTCAGCTGTGTTAGATGCCAGGGTTTCCATTTTCAGGGCTATA 
AAAGTATTATGTGGGAAATGAGACATCAGACCACCGGACGTTACCACTTGGCA 

15 AATCTGTCCACTGTGGAGTTGGTGATGTTGGAACCATTCCACACTATGTGACC 
TCTGCTGGGTCACACACTCAGGAGGTGAAGGGCTGAGATGAAATGCTGCAGC 
CTTGGGGCTTGTGCAGCCTGATACTGAAATAGCATCCACTTGTGCACTGAATA 
AATAGAAACTTGATCGTTTTATTCTGACTAGATATTATCATTCTCTGCTAAGAC 
AATATAGTTTGAAATATTATAGTTTGAATATAAGGAGGAAAGCTTGATGTACT 

20 TTAAATATACTGTGAACTCTAATAATGTGGGGATATTTTTCAACTTTAATTTTC 
TTAAGTATAAATTATTTATGTAAATTCTTTGTTTTGCATATTTCATAGAACATG 
CATCTTTAAGCTTTATCATTGCCAACAATGTACAGAAAGAGAATAAAAGTATA 
AGTTTATGAATGTAAAAAAAAAAAAAAAAAAAA 



25 Mouse Ese2 coding sequence - Sequence ID NO:5 



ATGGCTCAGTTTCCCACAGCGATGAATGGAGGGCCAAATATGTGGGCTATTAC 
ATCTGAAGAACGTACTAAGCATGATAAACAGTTTGATAACCTCAAACCTTCAG 
GAGGTTACATAACAGGTGATCAAGCCCGTACTTTTTTCCTACAGTCAGGTCTG 
30 CCGGCCCCGGTTTTAGCTGAAATATGGGCCTTATCAGATCTGAACAAGGATGG 
GAAGATGGACCAGCAAGAGTTCTCTATAGCTATGAAACTCATCAAGTTAAAGT 
TGCAGGGCCAACAGCTGCCTGTAGTCCTCCCTCCTATCATGAAACAACCCCCT 
ATGTTCTCTCCACTAATCTCTGCTCGTTTTGGGATGGGAAGCATGCCCAATCTG 
TCCATTCATCAGCCATTGCCTCCAGTTGCACCTATAGCAACACCCTTGTCTTCT 
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GCTACGTCAGGGACCAGTATTCCTCCCCTAATGATGCCTGCTCCCCTAGTGCCT 
TCTGTTAGTACATCCTCATTACCAAATGGAACTGCCAGTCTCATTCAGCCTTTA 
TCCATTCCTTATTCTTCTTCAACATTGCCTCATGCATCATCTTACAGCCTGATG 
ATGGGAGGATTTGGTGGTGCTAGTATCCAGAAGGCCCAGTCTCTGATTGATTT 
5 AGGATCTAGTAGCTCAACTTCCTCAACTGCTTCCCTCTCAGGGAACTCACCTAA 
GACAGGGACCTCAGAGTGGGCAGTTCCTCAGCCTTCAAGATTAAAGTATCGGC 
AAAAATTTAATAGTCTAGACAAAGGCATGAGCGGATACCTCTCAGGTTTTCAA 
GCTAGAAATGCCCTTCTTCAGTCAAATCTCTCTCAAACTCAGCTAGCTACTATT 
TGGACTCTGGCTGACATCGATGGTGACGGACAGTTGAAAGCTGAAGAATTTAT 
1 0 TCTGGCGATGCACCTC ACTGAC ATGGCC AAAGCTGGAC AGCC ACT ACC ACTG A 
CGTTGCCTCCCGAGCTTGTCCCTCCATCTTTCAGAGGGGGAAAGCAAGTTGAT 
TCTGTTAATGGAACTCTGCCTTCATATCAGAAAACACAAGAAGAAGAGCCTCA 
GAAGAAACTGCCAGTTACTTTTGAGGACAAACGGAAAGCCAACTATGAACGA 
GGAAACATGGAGCTGGAGAAGCGACGCCAAGTGTTGATGGAGCAGCAGCAGA 
1 5 GGGAGGCTGAACGCAAAGCCCAGAAAGAGAAGGAAGAGTGGGAGCGGAAAC 
AGAGAGAACTGCAAGAGCAAGAATGGAAGAAGCAGCTGGAGTTGGAGAAAC 
GCTTGGAGAAACAGAGAGAGCTGGAGAGACAGCGGGAGGAAGAGAGGAGAA 
AGGAGATAGAAAGACGAGAGGCAGCAAAACAGGAGCTTGAGAGACAACGCC 
GTTTAGAATGGGAAAGACTCCGTCGGCAGGAGCTGCTCAGTCAGAAGACCAG 
20 GGAACAAGAAGACATTGTCAGGCTGAGCTCCAGAAAGAAAAGTCTCCACCTG 
GAACTGGAAGCAGTGAATGGAAAACATCAGCAGATCTCAGGCAGACTACAAG 
ATGTCCAAATCAGAAAGCAAACACAAAAGACTGAGCTAGAAGTTTTGGATAA 
ACAGTGTGACCTGGAAATTATGGAAATCAAACAACTTCAACAAGAGCTTAAGG 
AATATCAAAATAAGCTTATCTATCTGGTCCCTGAGAAGCAGCTATTAAACGAA 
25 AGAATTAAAAACATGCAGCTCAGTAACACACCTGATTCAGGGATCAGTTTACT 
TCATAAAAAGTCATCAGAAAAGGAAGAATTATGCCAAAGACTTAAAGAACAA 
TTAGATGCTCTTGAAAAAGAAACTGCATCTAAGCTCTCAGAAATGGATTCATT 
TAACAATCAGCTGAAGGAACTCAGAGAAAGCTATAATACACAGCAGTTAGCCC 
TTGAACAACTTCATAAAATCAAACGTGACAAATTGAAGGAAATCGAAAGAAA 
30 AAGATTAGAGCAAATTCAAAAAAAGAAACTAGAAGATGAGGCTGCAAGGAAA 
GCAAAGCAAGGAAAAGAAAACTTGTGGAGAGAAAGTATTAGAAAGGAAGAAG 
AGGAAAAGCAAAAACGACTCCAGGAAGAAAAGTCACAGGACAAAACTCAAGA 
AGAGGAACGAAAAGCTGAGGCAAAACAAAGTGAGACAGCCAGTGCTTTGGTG 
AATTACAGAGCACTGTACCCTTTTGAAGCAAGAAACCATGATGAGATGAGTTT 
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TAGtTCTGGGGATATAATTCAGGTTGATGAAAAAACTGTAGGAGAGCCTGGTT 
GGCTTTATGGTAGTTTTCAGGGAAAGTTTGGCTGGTTCCCCTGCAACTATGTA 
GAAAAAGTGCTGTCAAGTGAAAAAGCTCTGTCTCCTAAGAAGGCCTTACTTCC 
TCCTACAGTGTCTCTCTCTGCTACCTCAACTTCTTCCCAGCCACCAGCATCAGT 

5 GACTGATTATCACAATGTATCCTTCTC AAACCTT ACTGTTAAT AC AACATGGC A 
GCAGAAGTCAGCTTTTACCCGCACTGTGTCCCCTGGATCTGTGTCCCCCATTCA 
CGGACAGGGGCAGGCTGTAGAAAACCTGAAAGCCCAGGCCCTTTGTTCCTGG 
ACGGCAAAGAAGGAGAACCACCTGAACTTCTCAAAGCACGACGTCATCACTGT 
CCTGGAGCAGCAGGAAAACTGGTGGTTTGGGGAGGTGCACGGAGGAAGAGGA 

1 0 TGGTTCCCCAAGTCTTATGTCAAGCTCATTCCTGGGAATGAAGTACAGCGAGG 
AGAGCCAGAAGCTTTGTATGCAGCTGTGACTAAGAAACCTACCTCCACAGCCT 
ATCCAGTTACCTCCACAGCCTATCCAGTTGGAGAAGACTACATTGCACTTTATT 
CATACTCAAGTGTAGAGCCCGGGGATTTGACTTTCACTGAAGGTGAAGAAATT 
CTAGTGACCCAGAAAGATGGAGAGTGGTGGACAGGAAGTATTGGAGAGAGAA 

1 5 CTGGAATCTTCCCGTCCAACTACGTCAGACCAAAGGATCAAGAGAATTTTGGG 
AATGCTAGCAAATCTGGAGCATCAAACAAAAAACCCGAGATCGCTCAAGTAAC 
TTCAGCATATGCTGCTTCAGGGACTGAGCAGCTCAGCCTTGCGCCAGGACAGT 
TAATATTAATCTTAAAGAAAAACACAAGCGGGTGGTGGCAAGGAGAGCTACA 
GGCCAGAGGGAAGAAACGACAGAAGGGATGGTTTCCTGCCAGCCATGTAAAG 

20 CTGCTAGGTCCAAGCAGTGAAAGAACCATGCCTACTTTTCACGCTGTATGTCA 
AGTGATTGCTATGTATGACTACATGGCGAATAACGAAGATGAGCTCAATTTCT 
CCAAAGGACAGCTGATTAATGTTATGAACAAAGATGACCCTGACTGGTGGCAA 
GGAGAAACCAATGGTCTGACTGGTCTCTTTCCTTCAAACTATGTTAAGATGAC 
AACAGACTCAGATCCAAGTCAACAGTGGTGTGCTGACCTCCAAGCCCTGGACA 

25 CAATGCAGCCTACGGAGAGGAAGCGACAGGGCTACATTCACGAGCTCATTCA 
GACAGAGGAGCGGTACATGGACGACCTGCAACTTTTTGAACAAAAAACTCTCC 

TTTGA 

30 
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mouse Ese2 - Sequence ID NO: 6 

MAQFPTAMNGGPNMWAITSEERTKHDKQFDNLKPSGGYITGDQARTFFLQSGLP 
5 APVI.AEIWALSDLNKDGKMDQQEFSIAMKLIK1,KLQGQQLPVVLPPIMKQPPMFS 
PLISARFGMGSMP^SmQPLPPVAPIATPLSSATSGTSIPPLMMPAPLVPSVSTSSL 
PNGTASLIQPLSIPYSSSTLPHASSYSLMMGGFGGASIQKAQSLIDLGSSSSTSSTAS 
LSGNSPKTGTSEWAVPQPSRLKYRQKFNSLDKGMSGYLSGFQARNALLQSNLSQT 
QLATIWTLADIDGDGQLKAEEFILAMHLTDMAKAGQPLPLTLPPELVPPSFRGGK 

] 0 QVDSWGTLPSYQKTQEEEPQKKLPVTFEDKRKANYERGNMELEKRRQVLMEQ 
QQREAERKAQKEKEEWERKQRELQEQEWKKQLELEKRLEKQRELERQREEERR 
KEIERREAAKQELERQRRLEWERLRRQELLSQKTREQEDIVRLSSRKKSLFfLELEA 
VNGKHQQISGRLQDVQIRKQTQKTELEVLDKQCDLEIMEIKQLQQELKEYQNKLI 
YLVPEKQLLNERIKNMQLSNTPDSGISLLHKKSSEKEELCQRLKEQLDALEKETAS 

] 5 KXSEMDSFNNQLKELRJESYNTQQLALEQLFTKIKRDKLKEIERXRLEQIQKKKLED 
EAARKAKQGKENLWRESIRKEEEEKQKRLQEEKSQDKTQEEERKAEAKQSETAS 
ALVNYRALYPFEARNHDEMSFSSGDIIQVDEKTVGEPGWLYGSFQGKFGWFPCN 
YVEKVLSSEKALSPKKALLPPTVSLSATSTSSQPPASVTDYFINVSFSNLTVNTTWQ 
QKSAFTRTVSPGSVSPmCKJGQAVENLKAQALCSWTAKKENHLWSKHDVITVLE 

20 QQENWWFGEVHGGRGWFPKSYVKLIPG^VQRGEPEALYAAVTKKPTSTAYPV 
TSTAYPVGEDYIALYSYSSVEPGDLTFTEGEEILVTQKDGEWWTGSIGERTGIFPS 
>TYVRPKI)QENFGNASKSGASNKKPEIAQVTSAYAASGTEQLSLAPGQLILILKKNT 
SGWWQGELQARGKKRQKGWFPASITVKLLGPSSERTMPTFHAVCQVIAMYDYM 
AN>ffiDELNFSKGQLlNVMNKDDPDWWQGETNGLTGLFPSNYVKMTTDSDPSQQ 

25 WCADLQALDTMQPTERKRQGYIHELIQTEERYMDDLQLFEQKTLL 

Mouse Ese2 alternative transcript partial cDNA sequence - Sequence ID NO:7 

CCGTCTTCCACATTTCCCACATTGATCGTGTGTACACACTCCGAACAGACAAC 
3 0 ATCAACGAGAGGACGGCCTGGGTCC AGAAGATCAAGGGTGCCTC AGAGCAGT 
ACATCGACACTGAGAAGAAGAAACGGGAAAAGGCTTACCAAGCCCGTTCTCA 
AAAGACTTCAGGTATTGGGCGTCTGATGGTGCATGTCATTGAAGCTACAGAAT 
TAAAAGCCTGCAAACCAAACGGGAAAAGTAATCCATACTGTGAAGTCAGCAT 
GGGCTCCCAAAGCTATACCACCAGGACCCTGCAGGACACACTAAACCCCAAGT 
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GGAACTTCAACTGCCAGTTCTTCATCAAGGATCTTTACCAGGACGTTCTGTGTC 
TCACTATGTTTGACAGAGACCAGTTTTCTCCAGATGACTTCTTGGGTCGTACTG 
AAGTTCCAGTGGCAAAAATCCGAACAGAACAGGAAAGCAAAGGCCCCACCAC 
CCGCCGACTACTACTGCACGAAGTCCCCACTGGAGAAGTCTGGGTCCGCTTTG 
5 ACCTGCAACTTTTTGAACAAAAAACTCTCCTTTGAGGGCCTGGGGAAGCCAGA 
ACCAGGGGAGCTGCCCACAAGGCTGGGTCTAAAGACAGATTTTGCTCTCCCAG 
GACAGAGGAGCATCACATGGCTTCATCCATCAAACAGCCACACTCGCTGGGCC 
TGTATTTTATTGCACACTAAATTGCTAGCAATCTATGCAAACATGATCTTT 



10 Mouse Ese2 alternative partial protein containing C2 membrane-binding domains - 
Sequence ID NO:8 

WHISH1DRVYTLRTDNINERTAWVQKIKGASEQYIDTEKKKREKAYQARSQKTS 
G1GRLMVHVIEATELKACKPNGKSNPYCEVSMGSQSYTTRTLQDTLNPKWNFNC 
1 5 QFFIKDLYQDVLCLTMFDRDQFSPDDFLGRTEVPVAKJRTEQESKGPTTRRLLLHE 
VPTGEVWVRFDLQLFEQKTLL 

Novel Ese-coiled-coil interacting clones: 

Mouse homoloeue of C07E3 .1 protein (clone 65V - Sequence ID NQ:9 

20 

GAATTCGGCACGAGGGCTGAGAGAAGCGGACTCCGAGGACTCTGATGCTGAA 
GAGAAGCCTGTTAAGCAGGAGGACTTCCCGAAGATTTAGGACCAAAGAAGTT 
AAAGACGGGTGGCAATTTTAAGCCCAGCCAGAAAGGCTTTTCAGGAGGAACC 
AAGTCCTTCATGGACTTTGGCAGCTGGGAGAGACACACGAAAGGGATCGGGC 

25 AGAAGCTGCTGCAGAAGATGGGCTACGTCCCTGGGCGTGGCCTGGGGAAGAA 
CGCACAGGGGATCATCAACCCCATCGAAGCCAAACAGAGAAAAGGCAAGGGA 
GCCGTGGGGGCCTATGGCTCGGAGAGGACCACTCAGTCTCTGCAGGACTTCCC 
CGTGGCCGACTCGGAAGAGGAGGCAGAAGAGGAGTTTCAGAAGGAGCTGAGC 
CAATGGAGGAAAGACCCCAGCGGGAGCAAGAAGAAGCCAAAGTACTCTTACA 

3 0 AGACTGTGG AGGAGCTGAAGGCC AAGGGC AGGGTC AGCAAGAAGCTC AC AGC 
ACCTCAGAAGGAACTGTCTCAGGTCAAGGTGATCGACATGACAGGCCGGGAG 
CAGAAGGTGTACTACAGCTACAGCCAAATCAGCCACAAGCACAGCGTGCCCG 
ATGAAGGGGTGCCATTGCTGGCGCAGCTGCCCCCCACAGCCGGCAAGGAAGC 
CAGGATGCCGGGCTTTGCACTGCCTGAGCTGGAGCACAACCTGCAGCTGCTCA 
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TTGAGCGCACGGAGCAGGAGATCATCCAGAGCGACCGGCAGCTCCAGTATGA 
GCGGGACATGGTGGTCAGCCTGTCGCATGAGCTGGAGAAGACGGCCGAGGTT 
CTTGCACATGAGGAGCGTGTCATCTCTAACCTCAGCAAGGTGCTGGCCCTGGT 
GGAGGAATGTGAGCGCCGCATGCAGCCCCATGGCACCGACCCCCTCACTCTGG 
5 ATGAGTGTGCCCGCATCTTTGAGACACTACAGGACAAGTATTATGAGGAGTAC 
CGCCTGGCGGACCGCGCAGACCTCGCTGTGGCCATTGTCTACCCGCTCGTGAA 
GGACTACTTTAAGGATTGGCACCCCTCGAGGG 



Mouse partial C07E3. 1 protein (clone 65): - Sequence ID NO: 10 

10 

GTKSFMDFGSWERHTKGIGQKLLQ 

AVGAYGSERTTQSLQDFPVADSEEEAEEEFQKELSQWRKDPSGSKKKPKYSYKT 
VEELKAKGRVSKKLTAPQKELSQW 

PLLAQLPPTAGKEARNlPGFALPELEHr^QLLrERTEQEIIQSDRQLQYERDM 
1 5 SHELEKTAE\nLAHEERVlSNLSKVLALVEECERRMQPHGTDPLTLDECARIF^ 
DKYYEEYRLADRADLAVAJVYPLVKDYFKDWHPSR 

Mouse Novel cDNA clone 42/ Est accession #W29719 and #AA915044: - Sequence ID 
NO:ll 

20 

CATGGCGGCGGCTGCGGAGGGCGTCCCGGCGACGCGACGGAGGACGAGCCAC 
CTCGAGATGATGCTGCGGTGGAGACAGCCGAGGAAGCAAAGGAGC 

Mouse Novel cDNA clone 70: - Sequence ID NO: 12 

25 

CTTGAGTCTACTGAAAATACCCTGCAGGAAGCTACATCATCCATGTCTTTGAT 

GACCCAATTTGAACAGGAAGTATCTGGCCTCCAAAGACCATACGTGATATTGA 

GACTAGCGAAGAGATGC 

30 Mouse Novel cDNA clone 83/Est accession #AA589041 and W98708: - Sequence ID 
NO: 13 
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GAATTCGGCACGAGGGAGTCTGGTTCTGGAAAGCCGACAGAAGCTGAGCTTG 
TCAACTTAGATTTCTTGGGAGATTTGGATGTTCCGGTATCTGCCCCACCCCTGT 
GTGTCTGAGCTCGAGTCTCTCTGCTGGACTATGG 
Novel Ese-SH3 interacting clones: 

Mouse YNK1 partial cDNA: - Sequence ID NO: 14 

CTTTACGAGCAGAGGGAGCCAAATTCAGAGCCGTTTTAGATAAAGCTGTGCAA 

GCGGATGGACAGGTGAAGGAGCGCTACCAGTCCCATCGAGACACCATCGCAC 

TTCTGTGTAAGCCGGAGCCAGAGCTGAATGCTGCCATCCCCTCTGCTAACCCA 

GCAAAGACCATGCAGGGCAGCGAGGTTGTAAGTGTCTTAAAGTCCTTATTATC 

AAATCTTGATGAAATCAAGAAGGAAAGAGAGAGTCTTGAGAATGACCTGAAG 

TCAGTGAATTTTGACATGACAAGCAAGTTTTTGACAGCTCTGGCCCAAGATGG 

CGTGATAAATGAGGAGGCTCTCTCTGTCACTGAGCTGGATCGGATCTATGGCG 

GTCTAACAAGTAAAGTTCAAGAGTCTCTGAAGAAACAAGAGGGACTTCTAAAA 

AATATACAGGTCTCACACCAAGAATTCTCCAAAATGAAGCAATCTAACAACGA 

GGCTAACTTGAGAGAAGAAGTTCTGAAGAACCTAGCAACTGCGTATGACAACT 

TTGTTGAGCTTGTAGCTAACTTGAAGGAGGGCACAAAGTTTTACAATGAGCTG 

ACTGAGATCCTGGTCAGGTTCCAGAACAAATGCAGTGACATAGTGTTTGCACG 

GAAGACAGAAAGAGACGAGCTCTTGAAGGATCTGCAGCAGAGCATTGCCAGA 

GAGCCCAGCGCTCCTTCAATCCCTCCTCCAGCCTATCAGTCCTCCCCAGCAGC 

GGGGCATGCAGCAGCGCCTCCAACTCCAGCCCCAAGAACCATGCCGCCTGCTA 

AGCCCCAGCCTCCAGCCCGGCCTCCACCTCCTGTGCTTCCTGCAAACCGAGTT 

CCTCCTGCTTCTGCTGCTGCTGCCCCTGCAGGCGTGGGGACGGCTTCAGCAGC 

GCCGCCACAGACCCCTGGCTCTGCTCCCCCGCCACAGGCTCAGGGACCACCAT 

ACCCTACCTATCCAGGATATCCCGGGTATTGCCAAATGCCCATGCCCATGGGC 

TACAACCCCTACGCATATGGCCAGTACAATATGCCGTACCCACCGGTGTATCA 

CCAGAGCCCCGGACAGGCTCCATACCCAGGACCCCAGCAGCCTACCTACCCCT 

TCCCTCAGCCCCCGCAGCAGTCCTACTATCCACAGCAGTAACGCTGCCACGTG 

CTGCTGGTTCAGATCAGAGCGACAGGACAGCAGCTGCCACCAGCTCTAAGCCA 

CGCTCTGGCCACTCGAGAGTATCTTGCTCTATTGATTGCTGTGGATGATTTCTG 

TCTGTGGCTAAAGCCGAAGGCTGGGCCCCACCTCCACATTTGATCGCACTCGT 

GAGATTCTGCTGCTGTTGCAGTATAAACGCTAGCTATAATAGCATTTGAAAAA 

AATTACAGTTCCATAAAATGCTGAAAATGAGAAATTAAACCTGCAAGTGAAAC 
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ATTTGAAATTAGCATACTTTATAAGATGCAGTTGGGACAAAGATGGCTTAAGT 
ACTGATATTTAAGGAAAAAGTTTTCTTTCTCTTTTGGTTTATTGATTTAGTTTA 
ATTTCTATTATGATATTTTGCATAATCAAGGCATTGTAAATCTTATAATTTAAA 
AATAAATTACTTACGAACAGTTGTCATTGTTATGTTTTGTCATTGATTCTCATT 
5 GCTGTCTAGTTCCTTTCTGGTATTAGCCTCTCCTTCTGTATGTTCACAGGCTCC 
ATTACTGTGTTGAATTGCGTGACGTCAGGTGAGCAGTCAGGGAGGGCTGCTCT 
GCGGACGCCAAGCGCACACCAGCTTGTCTCAGGCTCAGCAGTCAGCTCATCTG 
GACATTTCTATTTAAAAGTCCTTTAATGTGGAAGATACACACAATTGTTACCA 
AAGGTTCTTCCAATTAATTTTACAATTTAAAAAGTATGTATTAATG'l'ri"rA'r'l'G 

1 0 TTAGATTTTCCAAAAAAATGATGCAAATTCTGGTAATATTCATTTCCCTCACCC 
ATAATTTGGTTAAAATGAGTAGTTTTAGCCATACAGTCTCATCTGCTGTGGAG 
GAACCTGGAGAAAGTCCCCTGTGCCTTTCTAGCCCTTGGGTTCTATTCTTATCC 
TGCAATGTCTACTGCACAGTGTGTTTGAGCAGATCCTAACCCTCCTTTTACAGT 
TTCTTCTTCTTACTTCTTTATTCTTTTTGTGGCTCCTGAAATCTGAGGTTATTTT 

15 GTAATTCAGGAGCATGCAGGACAATTGTTGGGACATGTGCCTAGTCCGGAATA 
C AGCCC AGG AC AGC AAGGAG ATGC GTCCT GC AC C AGG AAGCCGTGC A GGC AG 
GAGCTGTCCAAGGTCCCGGCGGCTCTGCCTGTGTGAGGCAGGAGAATGAGCA 
GATTCCCTAATCTATGTTCTCGAAGTTTAATGCTGATGTTGTCTTGCCTTATCC 
TCATTTAACTGATACTGTCACCCAGTCCACCTTTGCTCTCATTGCAAAGTGATA 

20 GTGTAATTTCAAATGTAAGACTGAAGATACGATTGTAAAAGGGAGTAAACTGG 
TTTAAACGTGTTATTCTAAAGCACCTTACTTTGTTGTTGTATGCAGAAAACACA 
GATGCGCTAATTCAGTATAAATGACTGATTGCCTGGAATTTGGACGTTGGCTT 
AAAGTCCGATAGCTAAACCTTGGCAAAACATAACAAACATTTCATTGCTCAGC 
CTCAGTGCTCTGGAGTATTCAGTGTATGAGACAGGTTTATTTGAGTCCTCTGTA 

25 AATGGCATTTGAATTTTATATTCTCCCCTCCCGAGTATCTTATAAGACATCCCC 
TGAGTTAGGGAGTTCCCAGACTGCTACTCTATTCCTTATGAATGCAAAACAAC 
CACCAATAGAACAAAAAAAAAAAAAAAAACTCGAG 

Mouse YNK1 partial coding cDNA - Sequence TD NO:l 5 

30 

CTTTACGAGCAGAGGGAGCCAAATTCAGAGCCGTTTTAGATAAAGCTGTGCAA 
GCGGATGGACAGGTGAAGGAGCGCTACCAGTCCCATCGAGACACCATCGCAC 
TTCTGTGTAAGCCGGAGCCAGAGCTGAATGCTGCCATCCCCTCTGCTAACCCA 
GCAAAGACCATGCAGGGCAGCGAGGTTGTAAGTGTCTTAAAGTCCTTATTATC 
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AAATCTTGATGAAATCAAGAAGGAAAGAGAGAGTCTTGAGAATGACCTGAAG 

TCAGTGAATTTTGACATGACAAGCAAGTTTTTGACAGCTCTGGCCCAAGATGG 

CGTGATAAATGAGGAGGCTCTCTCTGTCACTGAGCTGGATCGGATCTATGGCG 

GTCTAACAAGTAAAGTTCAAGAGTCTCTGAAGAAACAAGAGGGACTTCTAAAA 

AATATACAGGTCTCACACCAAGAATTCTCCAAAATGAAGCAATCTAACAACGA 

GGCTAACTTGAGAGAAGAAGTTCTGAAGAACCTAGCAACTGCGTATGACAACT 

TTGTTGAGCTTGTAGCTAACTTGAAGGAGGGCACAAAGTTTTACAATGAGCTG 

ACTGAGATCCTGGTCAGGTTCCAGAACAAATGCAGTGACATAGTGTTTGCACG 

GAAGACAGAAAGAGACGAGCTCTTGAAGGATCTGCAGCAGAGCAl TCjCCJAGA 

GAGCCCAGCGCTCCTTCAATCCCTCCTCCAGCCTATCAGTCCTCCCCAGCAGC 

GGGGCATGCAGCAGCGCCTCCAACTCCAGCCCCAAGAACCATGCCGCCTGCTA 

AGCCCCAGCCTCCAGCCCGGCCTCCACCTCCTGTGCTTCCTGCAAACCGAGTT 

CCTCCTGCTTCTGCTGCTGCTGCCCCTGCAGGCGTGGGGACGGCTTCAGCAGC 

GCCGCCACAGACCCCTGGCTCTGCTCCCCCGCCACAGGCTCAGGGACCACCAT 

ACCCTACCTATCCAGGATATCCCGGGTATTGCCAAATGCCCATGCCCATGGGC 

TACAACCCCTACGCATATGGCCAGTACAATATGCCGTACCCACCGGTGTATCA 

CCAGAGCCCCGGACAGGCTCCATACCCAGGACCCCAGCAGCCTACCTACCCCT 

TCCCTCAGCCCCCGCAGCAGTCCTACTATCCACAGCAGTAA 

Mouse YNK1 partial protein - Sequence ID NO: 16 

LR AEG AKFRA VLDK A VQ ADGQVKERYQSHRDT1 ALLCKPEPELN AAIP S ANP AKT 

MQGSEWSVLKSLLSNLDEIKKERESLENDLKSVNFDMTSKFLTALAQDGVINEE 

ALSVTELDRIYGGLTSKVQESLKKQEGLLK>J1QVSHQEFSKMKQSKNEANLREEV 

LKNLATAYDNFVELVANLKEGTKFYNELTEILVRFQNKCSDIVFARKTERDELLK 

DLQQSIAREPSAPSIPPPAYQSSPAAGHAAAPPTPAPRTMPPAKPQPPARPPPPVLP 

ANRVPPASAAAAPAGVGTASAAPPQTPGSAPPPQAQGPPYPTYPGYPGYCQMPM 

PMGYNPYAYGQYNMPYPPVYHQSPGQAPYPGPQQPTYPFPQPPQQSYYPQQ 

Mouse novel cDNA clone 4: - Sequence ID NO: 17 

GGTCTTGGCTAGAATTTTAAATTTCTTCTCATTTGAGTAAAATGTTGCATTCTG 
AAGTCCCATGCTACCTGAAGTTGCATTTGGAGTCCCAAGCTACTGGAATGTTT 
ATATGTGACCGTTTCCCAGGAGGCTTACACTGCAGAAGGAAGAATGAATCTAG 
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GTGAGGTGGGCAGCTGCTTGGCAGTCCTCTCTTGTGCCCCAACTGTAAACCAG 
ATAGAAATGTTCAGGGGAGGATACTTTCATTATTGTGGTTTGTAGTGTTAAGA 
TGATTGCTTCTGCCTTGGAAATACCTCAAGCTGTTCTTATTTAACAGGTAAGTG 
ACTGAGTATAATATTCCAGAAAAATTTGAAATCCTAATTTCTTCCATATTTCAT 
5 TAAATTTTTTGCATACAGGTCTAACAAATATGGATATGTATACACATCCTCTTT 
AATGAAGGTATTATTTTGGTTACTTTTCCTAAGATATACCTTAAAAGATGTTCT 
ATACATTTCCTACTTAAATTCTGGGGGATTTGGAGTATGTACATGATAAAAAA 
GATTATAATATATCGATTGAAGTTACTTTATTTTCTAATTAGAATTATTTTAAT 
AGTCCTTTATTGAATAAGTGCTGTAATTTGTTTGCTATGAGACTTATlCCTGAr 

1 0 GTGAATGTAAATTATTTTTCCACATGCATGAAAAAATGTATGTACTAATCAGA 
GTTGTCTCCATTGCATTGAAATTACTTGTTTTGAACTAAAGTAACTCATATTTA 
TGTAGTAGAATGCTTATGTTTTCAGACTTTGTAATGATTTCCTTTGGATGTATT 
TTAAATCAATCGGTCTGGGTAACATATCAGTTTAGATTAATATGTGCTTAAAA 
GAAGAAAAAAATTCAATGGTTCATAGTAGAAATGTGCCACACTTAAATAAGCT 

1 5 CTGTATGACATGAAATTCTGTTAAAACATTGTAATTCATGGTGACTTTTAACTT 
ATAAAAATACTACTTGCACGGGTTACTTGATTTATGGATATATGAAAACTTCTC 
AGGACGAAAGTTCTTCTTTCTCTAGAACTATTCTTCTGTCGGTCATGCAGAATG 
CTGTTATTCTGAAAAGTGTCCCTGTTGCATATGATGGTCACTTTATTTGGGGGG 
ATTCTTCATAAGATGTGAGATGTTGATGCCAGTCTTTCCCAAGTAAGTGCTCGT 

20 AAAAAAGGACTACTAACTAGCCTGCATCTGTCTCTAACTGGGACCAAGGGGTC 
TGCTGAAGGAAACTGAAGAGCTCTAACATTTTCACAGCTTGGAGAAGATAGAA 
TCTTTAAAAGTACAACTGAAGCTTGATCTATTTTACAAGTGCATTGATGGCCCC 
TGTCCTTCTCTGGTTCCTGTCATTTGAAACCAACTCCTGTTGTAAATAGGAAGA 
ATATGGGACATTCATATTTAAGAAAATTTGATGTCATTAGGTGACTAAGTAGA 

25 AGGCTTAGAAAAATGTATTCATTTGCAAGTATTTTGGCACAAGAAATTTTCCA 
ACTGAATAGTAAGCAAAAGCTAAGTTGTTTCATTGAAATCATAAGGCAGTTTA 
AGATAAACTGGAGAAGATAACTGTTCTAATAGAGGATAATCGAATTGATTGTC 
AAGTGGATGTTATTTATTGGATAGTGACAGAGTTTATTTGTAACCTTAATTATA 
TTAAAAGTTATTCTGTTAGGATGTTTTGTATTAATAAACGTGAACAAAATTAA 

30 AAAAAAAAAAAAAAAAACTCGAGGG 
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Mouse Novel cDNA clone 8/47/52: - Sequence ID NO: 1 8 

GAGAAGGCGGCCTGCCGCAGCGGGACAACCTAGAGCGCGACGTGGAGGCGCG 

TAGCGGAGCTGGAGCAACTGCGCACCGAGGTGGATGTGCGCATTAGCGCNNT 

GGACACCTGCGTCAAGGCCAAGTCGCTGCCAGCCGTCCCGCCGAGAGTCTCAG 

GCCCACCCCCGAACCCTCCACCCATTGATCCAGCTAGCCTGGAGGAATTCAAG 

AAAAGGATCCTGGAGTCTCAGCGGCTCCCTGTAGTCAACCCTGCWGCCCAACC 

CAGCGGTTGAGRACCCAGCTGCCGCAGGA(JU<J 1 UUO 1 UtUAUAA l c<^l,w\^ 

CTGTGGATGGGGGCAGCCAGGTGCCCACAGTGCTGGACACCCGCCGTGCCTG 

CCGGCAGCCTCCACCCCCAGCGCCTTCTCTGGCACCCCTTCACTGTCCCSTGCA 

TCCCCRCCATTCSSCASWSASKGGATTTAAGGCACACACAGCTGTGAGATGACT 

TCACATCGACCCCTTGTGCAGTGACCCGGATGGTGCCCCACCCACACATGAAG 

CACCCACAGCTCAGCTGCCACCCTAGGCAACTCCTCCGGTTTCCTATCACTCTG 

CTCCTGACCCGGGAGGTGAGAACAGGAAGCCCAGCCTTCAGCTCCCTTGGGAG 

TTTCCAGCCTCCCTCTTAAAGGCCACTAGGGTTTCCAGATCCTATTTGAGAGTC 

TCCAGGCCTCCCCTGAAGGGTTCTAGCCACCACGCCCACAGGATTCCCATTAG 

GTTTTAAAGTCTTTTCCAGAGTCCGCTGGTTCCCCTCCTCCTCACAAGGAAGGG 

CCTCAATTGTAGATGAGCGTTCCGGGTGGATCTTAGAGCCCTAGAGGGAGGCT 

TTTGCTTGTARCCCCCTAAAGATATTACTGGCACATAATAAATATGAAAGTCCT 

TTGAAA 

GTTGGACACTGCGCAAATGGGGCTCTCCATGGACCGCAGCCCATACGCCCGCA 

CGGGGGACCAGCAGCGCGGCTCTGGTTCTACCTGCGCTATTTCTTCCTCTTCGT 

GTCGCTCATTCAGTTCCTCATCATCCTGGGCCTGGTCCTCTTCATGATCTATGG 

CAATGTGCACGCCACCACTGAGTCCAGCCTGCGCGCCACGGAGATCCGCGCCG 

ACAGCCTGTACAGCCAGGTGGTTGGACTATCGGCCTCACAGGCTAACCTGAGC 

AAACAGCTGAACATCAGCTTGCTTGTCAAGGAAACAGTCATGCAGCAACTGTT 

GACTACGCGACGTGAGATGGAGCGCATCAACGCCAGCTTCCGCCAGTGCCAA 

GGCGACCTGATCACCTACATAAACTATAATCGCTTCATCGCCGCTATCATCCTG 

AGCGAGAAGCAGTGCCAGGAACAGCTGAAGGAGGTCAACAAGACCTGCGAAC 

TTTACTCTTCAAGCTGGGAGAAAAAGTTAAGACACTGGAGATGGAGGTGGCC 

AAGGAGAAGGCAGTGTGCTCCAAGGACAAGGAGAGCCTGCTGGCAGGAAAGC 

GGCAGACGGAAGAGCAGCTGGAGGCCTGTG 
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Mouse Novel cPNA clone 18/25: - Sequence ID NO: 19 
5 'end of partial clone 

5 TGTGCGCCGCCTCTAGAACTAGTGGATCCCCCGGGCCTGCAGGAATTCCGGCA 
CGACGGCCGAGCGCCGCGGACCACCCGCGGCTGCCCGCCGAGCCGTCGACAT 
GTGGGGGGACTGGGGTGGGAGCGGCCGGAGCAGCGCCAGGTACCCGGGCGC 
GCAGAACCATGGCTCTCGCTCGCCTGTCCTGACCTGGCTTGCTCGCCCCACCG 
AAGAATGTCAGCCAAGTCCAAGGGGAACCCICCICGI'CCICCGCAGCCGAGGG 

10 ACCGCCGGCAGCCTCCAAAACCAAGGTGAAGGAGCAGATCAAGATCATAGTG 
GAGGATCTGGAATTAGTCCTGGGCGACCTGAAGGACGTGGCCAAAGAACTTA 
AGGAGGTGGTTGACCAGATTGACACCCTGACCTCTGATCTACAGCTGGAAGAT 
GAGATGACCGACAGCTCCAAAACAGACACTCTGAACAGCAGCTCCAGTGGGA 
CAACAGCCTCCAGCATAGAGAAGATCAAAGAACAGGCCAATGCTCCCCTCATT 

15 AAACCTCCAGCACACCCGTCTGCTATCCTGACTGTCCTGAGAAAGCCAAACCC 
TCCACCGCCTCCTCCAAGGTTGACACCCGTGAGGTGTGAAGAGCCTCAGAGAG 
TGGTGCCGACTGCCAACCCTGTAAAGACCAATGGCACTCTTCTGCGGAATGGA 
GGCTT AGC GGGG AGGCCC AAC AAAATTCC AAATGG AG 

20 3' end - Sequence ID NO: 20 

CTCGAGTTTTTTTTTTTTTTTTTTTTTTTTTTTCATTATTTACTATTATTTATTGA 
CATATTTCCAAAGCTCAAAATATTTTATTATACATATAGTTGAACATATGTTTC 
AAATTGTATAGTATAGAAAATAAACTTTTTTGTAGTGTCCTCAGCATTTCATGA 
TGCAAAACTATTGACAAACATCTTTAGAAAAATAATAAAATAGTCCTTCGGTA 

25 TTAAAATTCTTATTAAAAAGCATTAGATCAAAGGGAGAACTATGACATCATCA 
ATGCATAGATGAGATAGGCATGAATGGAATGAGTTGCCCTGGCTTTATCAACA 
AATCAAAATATCTGACATCCCAGCTCTTATAATAGACCAAAATACTTGGAATC 
AGAAGGTCACAGTTTGTTTTAGGTCAATCACAAAAAAATAAAATTCATTCATA 
CTTTCTCAATTTTCCGCAGTTTCTGATGATGGAACATAGAAAACAATGTACGTC 

30 CAGGACAGAGGCGCTACTCTGCATACTTACCACGTGATTTTTTATGCCACTTTG 
TTGAATGCAGATTAATATATTTGGGCTTTTTATTGCTTGAGTAGAAAGTGCTCA 
TTACTTATTATTTTACGTTTATCATATAGAAAATTAAAAACAAACAGAACGTTT 
TCTTAAATGGCAGATATCACACTGTGGTAGTGGTGGATTTCCTCAGGATGGTC 
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TTCTGTGGTTTTGGTGCAGCGGGAGGAGGCACGGTTGCAGGTGTGGGAGGGG 
GGAAACTGTTACTGTGGCTTATTCCCAGTCCCCCATTTTCTAATGGGAAAT 

Mm.se Novel r-DNA clone 9.V F.ST acces sion #AA1 1995 1 - Sequence ID NO 21 

5 

GCACAGCCCCCCTCCATCCTGAAGAAAACCTCAGCGTATGGGCCTCCAGCTTC 
GGGCCGTGTCTATCCTTCCTCTCCTGGGACATGGTGTTCCCCGCTTGCCCCCCT 

GGCAGAAAACCG 
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TABLE 1 

Ese Proteins 

E S e 1 MAQFPTPFGGSLDWAITVE ERAKKDOOFLSLKPIAGFITGDOARNFFFO 

• : : : : : : : : : : : : : : : : : : ; : 
E S e 2 MAQF PTAMNG G PNMWA ITS EERTKHDKO FDNLKPSG5Y ITGDOARTFFLO 

EH1 SGLPQPVLAOIWALADMNNDGRMDn VEFSIAMKLI KLKLOGYOT.PfiTT.PP 100 

5GLPAPVLAEIWALSD LNKDGKMDCJO EF SIAMRL1 KI.KI.l x;i )O I , PVM/ I .MP TOTT 

VMKQQPVAISSAPA-FGIGGIASMP PLTAVAPV- - PMGSI PWGMSP 

: : : : :::: : : : : : : : • 

IMKQFPMFSPLISARFGMGSMPNLSIHQPLPPVAPIATPLSSATSGTSIP 

PLVSS VP PAAV P P LANG A P P VI Q PL PAFAH PAATW PKSSSFSRSGPG 191 

s: s: : : : :::: ; 

PLMMPAPLVPSVSTSSLPNGTASLIQPLS-IPYSSSTLPHASSYSLMMGG 199 

SQLNTKLQKAQSF- DVASAPPAA EWAV P QSSRLKY 

*•••• • * • S ' ! S ! S ! ! S ' 

FG-GASIQKAQSLIDLGSSSSTSSTASLSGNSPKTGTSEWAVP OPSRLKY 

EH2 ROLFNSHDKTMSGHLT GPOARTILMOS5LPOAOLASIWNL5SDIDQDGKLT 275 

• * * • • • • • * * * • • • • * • Z Z Z Z Z Z I z z z z z z z z 

ROKFNSLDKGMSGYLSGFOARNAL LOSNLSOTOLATIWTLADIDGDGOLK 2 96 

AEEFILAMHLIDVAMSROPT.PPVT.PPBVTPPCgPPPWPqnc^MWTgcccw 
:::::::::: : : ::::: : : : : : : : : : z : : : 

AEEFILAMHLTDMAKAGQPLPLTLPPELVPPSFR GGKQV DSV 

DQRLPEEPSSEDEQQPEKKLPVTFEDKKRENFERGSVELEKRRQALLEQQ 37 5 

:: ' • :::::::::: : : : : ::::::: : : : : 

NGTLPS YQKTQEEE- PQKKL PVTFEDKRKANYERGNMELEKRRQVLMEQQ 3 8 9 

RKEQERLAQLERAEQERKERERQEQEAKRQLELEKQLEKQRELERQREEE 
• » • i ' ::: : : : : : : : :::::: ::::::;r::::i: 

QREAERKAQKEKEEWERKQRELQEQEWKKQLELEKRLEKQRELERQREEE 

RRKEIERREAAKRELERQRQLEWERNRRQELLNQRNKEQEGTWLKARRK 475 
tttxtx ::::: :::::: : :;• • • s - 

RRKEIERREAAKQELERQRRLEWERLRRQELLSQKTREQEDI VRLSSRKK 4 8 9 
C-C TLEFELEALNDKKHQLEGKLQDIRCRLATQRQEIESTNKSRELRIAEITH 

• • • • * - • I Z Z Z Z Z Z Z mm m S • • S 

S LHLELEAVNGKHQQI SGRLQDVQI RKQTQKTELEVLDKQCDLEIMEI KQ 
LQQQLQESQQMLGRLIPEKQILSDQLKQVQQN--SLHRDSLLTLKRALEA 57 3 

• • • ••••• * m m m m » _ 

• ••••• • >■*■•• Z » Z m Z Z Z 

LQQELKEYQNKLI YLVPEKQLLNERIKNMQLSNTPDSGISLLHKKSS-E- 587 

KELARQQLREQLDEVERETRSKLQEIDVFNNQLKELREIHSKQQLQKQRS 
: • • i :::: : :: : : : : : :::::::::: r : : 

KEELCQRLKEQLDALEKETASKLSEMDSFNNQLKELRESYNTQQLALEQL 
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TABLE 1 Continued 
LEAARLKQKEQERKSLE-LEKQKED--AQRRV-QERDKQWLEHVQQEEQP 66 9 

********* * * • • * * • * * 

********* * • » • * • • * * 

HKIKRDKLKEIERKRLEQIQKKKLEDEAARKAKQGKEN LWRESIRKEEEE 6 67 
RPRKPHEEDRLKREDSVRKKEAEERAKPEMQDKQSRLFHPHQEPAKLATQ 

mi • a ■ a • a a • • • • 

KQK RLQEEKSQDKTQEEER- KAEA- -KQSET 

SH3A APWSTTKKRPr.TTSAnESV KWYYRALYPFESRSHDEITIOPGDIVMVDE 76 9 

* • ■ a • m m m • ♦ • 

• a a a a a a a 

ASALVNYRALYPFEARNHDEMSFS.SGDIIOVDE 74 8 



SOTGEPGWLGGELKGKTGWFPANYA EKI PENEVPTPAKPVTDLTS APAPK 

::::::: :::::::::: : : 
KTVGEPGWLYGSFOGKFGWFPCNYV EKVLSSE KALSPK 

LALRET PAPLPVTSSEPSTTPNNWADFSSTWPSSSNEKPETDNWDTWAAQ 86 9 

KALLPPTVSLSATST-SSQPPASVTDYHNV SFSNLTVNTTWQ 82 7 

PSLTVPSAGQLRQRSAFTPATATGSSPSPVLGQGEKVEGLQAQALYP]^£A 

m m m m m m • mm m m m * • 

* • a a • * • a* a • • • a • •••>» • • 

QKS AFTRTVS PGSV-SPI HGOGOAVENLKA QALCSWTA 

SH 3 B KKDNHLNFN KSDVITVLECAODMWWFGEVOGQKGWFPKSYVKLISGPVRKS 969 

mm * * * * * • *»*****•> *> ****** • ***•***•*»-»• * 

* * ***** • ********* * *********** • 

KKENHLNFSKHDVITVLEOOENWWFGEVHGGRGWFPKSYVK LI PGNEVQR 914 

T.qTiyrfiPTRSPASLKRVASPAAKPATP-GEEFI AMYTYESSEOGDLTFOO 

:: : : : : ::::::::::: 

GEPEALYAAVTKKPTSTAY PVTSTAY PVGEDYI ALYSYSSVE PGDLTFTE 

SH3C GDVIWTKKDGDWWTGTVGDKSGVFPSNYV RLKDSEGSGTAGKTGSLGKK 1068 

* * ** *** **•* * ■ «***•>** ** * * * * • *• 

* * ** **# ***• * * ******* *• • • • • • • • 

GEEI LVTOKDGEWWTGS I G ERTG I F PSNYV RPKDQENFGNASKSGASNKK 1014 

SH3D PKTAWTASYAATGPKOLTr.APnnT.TT.TRKKWPGGWWEGELOARGKKROI 
****** * * S " * * 2 *«••••*•» J 2 • 5 S 5 !*!#!IJi*'« 

PEIAOVT 5AYAASGTEQLSLAPGOLILILKKNTSGWWOGELOARGKKROK 

GWFPANYV KLLS PGTSKTTPTFr.PKTAVOPAVCOVl GMYDYTAONDDELA 1168 

: : : : : z z :::::: :*::::::: 

GWFPASHV KLLGPSSERTMPT FHAVCOVI AMYDYMANNEDELN 1107 

SH3E FRKGQI INVLNKEDPDWWKGEVSGOVGLFPSNYVKLTTDMDPSOO 1213 

* *•■■> * • IS Z • 5*5 • S 2 2 S 
FSKGOLINVMNKDDPDWWQGETNGLT GLFPSMYVKMTTDSDPSOQWCADL 1157 



QALDTMQPTERKRQG Y I HELI QTEERYMDDLQLFEQKTLL 1197 
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FIGURE 1 
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